Data Science and Statistics

From beginner to advanced, resources below can help you improve your data science/statistics skills.

External Resources

Snakemake Tutorial - A workflow manager for advanced coders to improve reproducibility and sharing of your work.

EMBL-EBI- Train modules for Scientists - Train online provides free courses on Europe's most widely used data resources, created by experts at EMBL-EBI and collaborating institutes. You do not need to have any previous experience of bioinformatics to benefit from this training. 

Comprehensive set of data science from beginning to intermediate to advanced courses -  All kinds of data science courses

How to code- basicsFor learning how to code in general, they have classes to learn different coding languages/coding skills.

Learning the basic coding in different languages Basics to learn different coding language with help from tutors.

High-Dimensional Data AnalysisIf you’re interested in data analysis and interpretation, then this is the data science course for you. 

Data Science SpecializationThis Specialization covers the concepts and tools you'll need throughout the entire data science pipeline, from asking the right kinds of questions to making inferences and publishing results.

Computing in Python I: Fundamentals and Procedural ProgrammingThis course starts from the beginning, covering the basics of how a computer interprets lines of code; how to write programs, evaluate their output, and revise the code itself; how to work with variables and their changing values; and how to use mathematical, boolean, and relational operators. By the end of this course, you'll be able to write small programs in Python that use variables, mathematical operators, and logical operators.

Using Python for ResearchThis course bridges the gap between introductory and advanced courses in Python. In this course, after first reviewing the basics of Python 3, we learn about tools commonly used in research settings. This version of the course includes a new module on statistical learning.

Learn Data Science OnlineWith DataCamp, you learn data science today and apply it tomorrow.

Best Practices in Programming for Reproducibility In ScienceInternal MSK course

Quantitative Genomics and Genetics Course (Tri-I)Taught by Jason Mezey from Weill Cornell

Biology Meets Programming: Bioinformatics for BeginnersThis course will cover algorithms for solving various biological problems along with a handful of programming challenges helping you implement these algorithms in Python.

R for Data ScienceA book-course here in an online form by Hadley Wickham himself and Garret Grolemund. This book will teach you how to do data science with R.

Software CarpentryLessons created collaboratively 

Rosalind (bioinformatics)a platform for learning bioinformatics and programming through problem solving

Genomic Data Science in Coursera - A series of free courses on how to handle NGS data

Bioconductor (courses)provides training in computational and statistical methods for the analysis of genomic data

Python for microscopylearn to start coding

Napari Napari is a fast, interactive, multi-dimensional image viewer for Python. Here we’ve provided a few tutorials to explore the main usage modes and methods of napari.

Basics of Image processing and analysis in ImageJ Material (lectures and assignments) that is used in a (very) basic course on image analysis and image processing, with a focus on images acquired from cells with fluorescence microscopy. It is centered around the use of the freeware software ImageJ/FIJI

Bio Image AnalysisRecommendations on Bioimage Analysis tools and tutorials

Statistics and RHarvard University edX

Statistics for Laboratory Scientists IJohns Hopkins University OpenCourseWare

Statistics and MathHoward Hughes Medical Institute BioInteractive

Data Science SpecializationJohns Hopkins University - Coursera

Statistical Analysis of Research Data (SARD)National Cancer Institute Center for Cancer Research

Introduction to BiostatisticsYale School of Medicine Yale Center for Clinical Investigation

Seeing Theory A visual introduction to probability and statistics

Modern Statistics for Modern BiologyThis is for people with the good understanding of basics of R.

Adobe Illustrator for Scientists A series of tutorials on using Illustrator, that's targeted at the tools and methods that are most likely to be used by scientists by Kevin Bonham.