AI and Data Scientist Roadmap¶
- Roadmap: https://roadmap.sh/ai-data-scientist
1. Mathematics¶
1.1 Linear Algebra, Calculus, Mathematical Analysis¶
- 1.1.1 Mathematics for Machine Learning (Courses)
1.2 Differential Calculus¶
- 1.2.1 Coursera: Algebra and Differential Calculus (Course)
2. Statistics¶
2.1 Statistics, CLT¶
- 2.1.1 Coursera: Introduction to Statistics (Course)
2.2 Hypothesis Testing¶
- 2.2.1 Coursera: Hypothesis Testing (Course)
2.3 Probability and Sampling¶
- 2.3.1 Coursera: Probability and Statistics (Course)
2.4 AB Testing¶
- 2.4.1 Practitioner's Guide to Statistical Tests (Article)
- 2.4.2 Experiment Design Article (Article)
2.5 Increasing Test Sensitivity¶
- 2.5.1 Minimum Detectable Effect (Article)
- 2.5.2 Paper: Improving Test Sensitivity (Paper)
- 2.5.3 Paper: Improving Sensitivity (CUPED) (Paper)
- 2.5.4 CUPED at Booking.com (Article)
- 2.5.5 Doordash: CUPAC (Article)
- 2.5.6 Netflix: Stratification (Paper)
2.6 Ratio Metrics¶
- 2.6.1 Microsoft: Delta Method in Metric Analytics (Paper)
- 2.6.2 Paper: Ratio Metrics (Paper)
3. Econometrics¶
3.1 Pre-requisites of Econometrics¶
- 3.1.1 Fundamentals of Econometrics (Book)
3.2 Regression, Timeseries, Fitting Distributions¶
- 3.2.1 Intro to Econometrics (Book)
- 3.2.2 Coursera: Econometrics (Course)
- 3.2.3 Kaggle: Learn Time Series (Course)
- 3.2.4 Kaggle: Time Series Basics (Tutorial)
- 3.2.5 ARIMA model for Time Series (Tutorial)
- 3.2.6 Time Series Models (Tutorial)
- 3.2.7 Forecasting Task with Solution (OpenSource)
- 3.2.8 Coursera: Linear Regression (Course)
4. Coding¶
4.1 Learn Python Programming Language¶
- 4.1.1 Learn Python: Kaggle (Course)
- 4.1.2 Google's Python Class (Course)
4.2 Data Structures and Algorithms (Python)¶
- 4.2.1 Algorithmic Exercises (Tutorial + Challenges)
- 4.2.2 Study Plans - Leetcode (Challenges)
- 4.2.3 Algorithms Specialization (Course)
4.3 Learn SQL¶
- 4.3.1 SQL Tutorial (Course)
5. Exploratory Data Analysis¶
5.1 Data understanding, Data Analysis and Visualization¶
- 5.1.1 Exploratory Data Analysis with Python and Pandas (Course)
- 5.1.2 Exploratory Data Analysis for Machine Learning (Course)
- 5.1.3 Exploratory Data Analysis with Seaborn (Course)
6. Machine Learning¶
6.1 Classic ML (Sup., Unsup.), Advanced ML (Ensembles, NNs)¶
- 6.1.1 Open Machine Learning Course - Open Data Science (Course)
- 6.1.2 Machine Learning Specialization (Course)
- 6.1.3 Pattern Recognition & ML by Christopher m. Bishop (eBook)
- 6.1.4 GitHub repository with notes & code from the eBook above
7. Deep Learning¶
7.1 Fully Connected, CNN, RNN, LSTM, Transformers, TL¶
- 7.1.1 Deep Learning Specialization (Courses)
- 7.1.2 Deep Learning Book (eBook)
- 7.1.3 Attention is all you need (Paper)
- 7.1.4 The Illustrated Transformer (Article)
8. MLOps¶
8.1 Deployment Models, CI/CD¶
- 8.1.1 MLOps Specialization (Courses)