Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Machine Learning for Aspiring Data Scientists: Zero to Hero

Name: Machine Learning for Aspiring Data Scientists: Zero to Hero
Rating: 4.8 (69 reviews)

Learn the foundations of machine learning to get a job in data science. No coding experience required.

Created byEmmanuel Maggiori

Last updated 5/2023

English

What you'll learn

Undertand the foundations of machine learning even if you're a total beginner
Be able to pass the typical machine learning interviews for data science jobs
Avoid rookie mistakes that waste companies' time and money
Learn machine learning without spending time on mathematical proofs and outdated methods that don't come up in interviews or work.
Build machine learning models with Python and Scikit-Learn
Understand linear regression, neural networks, random forest, gradient boosting, support vector machines

Course content

15 sections • 218 lectures • 16h 7m total length

Modeling an epidemic7:45
The machine learning recipe5:50
Quick exercise on machine learning recipe
The components of a machine learning model1:55
Why model?3:04
Fit a model to data to forecast future epidemic evolution beyond observed data. Emphasize extrapolation with trust in the exponential assumptions to guide decisions such as interventions or recommendations.
On assumptions and can we get rid of them?8:51
The case of AlphaZero11:18
AlphaZero learns move value from simulated games using a human-designed template, showing that human-encoded template assumptions drive performance.
Overfitting/underfitting/bias/variance11:12
Why use machine learning4:33
Uncover why to use machine learning: automate parameter search for complex models, let data drive decisions, and reveal new insights you’d miss, with examples from recommendations and image analysis.
Notes on machine learning models0:05
Summary notes of this lecture.
Quiz on machine learning models

The InsureMe challenge5:41
Explore linear regression for predicting continuous insurance claims in a real-world Insure Me scenario. Learn how X features map to Y claims through training on historical data.
Supervised learning5:20
Learn how supervised learning uses true labeled data during training to predict claims from client features, and how inference uses the trained model to estimate future claims quickly.
A quick note on the word "features"0:19
Linear assumption3:15
Linear regression template6:52
Non-linear vs proportional vs linear4:49
Explore linear, nonlinear, and proportional relations with real-world examples—car rental thresholds, tax structures, and commissions—and connect these to linear regression concepts and the intercept.
Linear regression template revisited3:51
Loss function8:19
Training algorithm8:19
Code time14:33
R squared5:41
Why use a linear model?3:41
Explore when to use a linear model by weighing assumptions, opportunities for insights from coefficients, and the value of explainability, while noting limits and where transformations may help.
Kaggle notebook on linear regression0:07
5-minute code assignment
Notes on supervised learning and linear regression0:05
Quiz on supervised learning and linear regression
Finding closed-form solution to linear regression (optional)0:06

Introduction to scaling5:52
Min-max scaling3:01
Code time (min-max scaling)9:16
Explore min-max scaling with pandas to scale each feature independently, train a linear regression on the scaled data, and compare outcomes with mean squared error.
The problem with min-max scaling2:50
Assess the problems of min-max scaling, notably its distortion by outliers and keeping targets unscaled, then consider alternative input scaling approaches for better interpretation.
What's your IQ?11:11
Standard scaling4:12
Code time (standard scaling)2:24
Model before and after scaling5:15
Inference time6:56
During inference, scale new inputs with the training data's mean and standard deviation, not row-specific values, and remember these statistics to ensure consistent predictions.
Pipelines2:53
Code time (pipelines)4:52
Kaggle notebook on scaling and pipelines0:06
Notes on scaling and pipelines0:05
Quiz on scaling and pipelines

Spurious correlations3:42
L2 regularization10:13
Code time (L2 regularization)5:19
L2 results2:28
L1 regularization5:52
Code time (L1 regularization)3:41
L1 results2:02
Drive L1 regularized linear regression with varying alphas to zero out some features, discarding phone numbers and house size, while validation seeks a sweet spot to balance error and overfitting.
Why does L1 encourage zeros?8:50
L1 vs L2: Which one is best?1:24
Kaggle notebook on regularization0:06
5-minute code exercise
Notes on regularization0:05
Quiz on regularization

Introduction to validation2:04
Why not evaluate model on training data5:34
Learn why evaluating models on training data misleads about performance, and use held-out data to compare regularized and unregularized models, guarding against overfitting and improving generalization.
The validation set4:51
Split data into training, validation, and test sets; train a pipeline on the training data, scale with the training mean and std, and assess out-of-sample generalization on the validation set.
Code time (validation set)7:36
Error curves8:20
Model selection5:43
Use the validation set to perform hyperparameter tuning across model templates, comparing L1 and L2 regularization with different alphas via grid search, to select the best model and estimate generalization.
The problem with model selection5:39
Tainted validation set4:48
Monkeys with typewriters2:32
My own validation epic fail7:24
The test set5:30
What if the model doesn't pass the test?5:13
How not to be fooled by randomness2:11
Design experiments in advance and split data into training and validation sets to minimize being fooled by randomness, guided by the no free lunch theorem and cross-validation to estimate performance.
Cross-validation3:37
Use cross-validation with multiple folds (typically five or ten) to train models in Python and average results to reduce variance. Reserve a test set at the end for model selection.
Code time (cross validation)6:35
2-minute cross-validation exercise
Cross-validation results summary2:08
AutoML5:20
Explore AutoML with h2o, which automates model selection by trying multiple templates, ensembles, and fivefold cross-validated performance, and even generates new features by multiplying feature pairs.
Is AutoML a good idea?5:27
Red flags: Don't do this!6:55
Red flags summary and what to do instead4:42
Your job as a data scientist3:23
Kaggle notebook on validation and cross-validation0:04
30-minute code assignment with new dataset!0:22
Notes on validation and testing0:02
Extra reading: Model retraining0:04
Extra reading: The Difference between Statistics and Machine Learning0:04

Intro and recap1:48
Mistake #1: Data leakage5:25
The golden rule4:06
Helpful trick (feature importance)2:11
Use feature importance after training to spot data leakage and assess if the model relies on suspicious inputs, like basket size, via linear model coefficients.
Real example of data leakage (part 1)4:30
Real example of data leakage (part 2)5:12
Another (funny) example of data leakage2:17
Mistake #2: Random split of dependent data4:52
Another example (insurance data)4:35
Mistake #3: Look-Ahead Bias6:11
Identify look-ahead bias as a common pitfall where future information leaks into predictions. Avoid aggregated time-series features that reveal November prices early, which cheats the model’s training and deployment.
Example solutions to Look-Ahead Bias1:38
Explore two look-ahead bias solutions: using the previous year's monthly average to capture seasonality, and applying a 30-day rolling average before the purchase date.
Consequences of Look-Ahead Bias2:25
How to split data to avoid Look-Ahead Bias3:14
Cross-validation with temporally related data2:51
Master temporally aware cross-validation for time-series data by training on prior periods and validating on later months with rolling or sliding windows, for robust model selection and tuning.
Mistake #4: Building model for one thing, using it for something else4:07
Sketchy rationale5:41
Learn how predicting purchases does not prove a model understands price sensitivity, and why causal inference and counterfactuals require separate validation before using a model for new tasks.
Why this matters for your career and job search3:35
Find the error: 10-minute code assignment0:12
Assignment solution0:50
Notes on common mistakes0:03
Quiz on common mistakes

Classifying images of handwritten digits6:42
Why the usual regression doesn't work4:24
Machine learning recipe recap1:47
Logistic model template (binary)12:38
Decision function and boundary (binary)4:48
Logistic model template (multiclass)13:55
Decision function and boundary (multi-class)1:26
Summary: binary vs multiclass1:22
Learn why binary classification uses a single output and deduces the other class, while multiclass requires K outputs and a self max function to combine probabilities.
Code time!20:06
Why the logistic model is often called logistic regression5:21
One vs Rest, One vs One5:14
Kaggle notebook on logistic model for digit classification0:04
5-minute coding task: How smart is this model?
Notes on Logistic Model0:03
Quiz on Logistic Model

Recap2:48
No closed-form solution2:10
In logistic model, there is no closed form solution to obtain optimal parameters; instead evaluate the cross entropy loss for parameter sets with a trial and error approach.
Naive algorithm3:46
Fog analogy5:15
Gradient descent overview2:56
The gradient6:24
Learn to compute the gradient of the loss as a vector of partial derivatives across all model parameters, guiding the fastest ascent or descent toward optimal weights and biases.
Numerical calculation2:02
Parameter update4:06
Convergence2:30
Analytical solution2:35
[Optional] Interpreting analytical solution4:51
Gradient descent conditions3:06
Beyond vanilla gradient descent2:49
Code time7:19
Reading the documentation10:45
10-minute coding exercise: Classify images of clothes0:08
Notes on Gradient Descent0:03

Binary classification and class imbalance5:51
Assessing performance4:00
Accuracy6:55
Accuracy with different class importance3:46
Precision and Recall7:18
Sensitivity and Specificity3:19
Explore sensitivity and specificity in binary classification, including how many positives and negatives are correctly identified, and the tradeoff with recall, precision, and thresholds.
F-measure and other combined metrics4:37
ROC curve7:04
Area under the ROC curve6:19
Custom metric (important stuff!)6:18
Other custom metrics3:26
Bad data science process :(3:51
Data rebalancing (avoid doing this!)5:41
Stratified split3:12
Notes on Classification Metrics0:03
Quiz on Classification Metrics

Requirements

No programming or advanced math experience required! You'll learn everything you need to know.

Description

This course will teach you the foundations of machine learning. The content was especially designed to help you pass machine learning interviews for data science jobs.

The course will help you:

Pass job interviews and technical quizzes
Avoid rookie mistakes that waste companies' time and money
Be prepared for real work.

Important stuff about this course:

You won't spend hours learning stuff that never comes up in a job interview.
Total beginners are welcome; coding experience or advanced math knowledge are not required.
It was designed by an industry expert who's been on the hiring side of the table and knows what companies are looking for.

This course will be of great help if you are:

A student who wants to prepare for work in data science after graduating.
An established professional or academic who wants to switch careers to data science.
A total beginner who wants to dabble in machine learning and data science for the first time.

How is this different from an academic course or a bootcamp?

In academic courses, your teacher spends hours speaking about calculus and linear algebra, but then none of that comes up in a job interview! That in-depth knowledge certainly has a place but is not what most companies are looking for.

In bootcamps you tend to learn how to use many tools but not how they work under the hood. This black-box knowledge is what companies want to avoid the most in applicants!

This course sits in between—you gain foundational knowledge and truly understand machine learning, without spending time on unimportant stuff.

Who this course is for:

Aspiring data scientists who want to get their first job in the field.
Software engineers who want to be involved in data science and machine learning.
Researchers who want to make the move from academia to industry.
Computer science graduates who want to dabble in data science.

Machine Learning for Aspiring Data Scientists: Zero to Hero

What you'll learn

Explore related topics

Course content

Machine Learning Models9 lectures • 55min

Linear regression15 lectures • 1hr 11min

Scaling and Pipelines13 lectures • 59min

Regularization11 lectures • 44min

Validation26 lectures • 1hr 46min

Common Mistakes20 lectures • 1hr 6min

Classification - Part 1: Logistic Model13 lectures • 1hr 18min

Classification - Part 2: Maximum Likelihood Estimation9 lectures • 41min

Classification - Part 3: Gradient Descent17 lectures • 1hr 4min

Classification metrics and class imbalance15 lectures • 1hr 12min

Requirements

Description

Who this course is for: