Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Feature Selection for Machine Learning

Name: Feature Selection for Machine Learning
Rating: 4.5 (2384 reviews)

Learn filter, wrapper, and embedded methods, recursive feature elimination, exhaustive search, feature shuffling & more.

Created bySoledad Galli, Train in Data Team

Last updated 3/2025

English

What you'll learn

Learn about filter, embedded and wrapper methods for feature selection
Find out about hybdrid methods for feature selection
Select features with Lasso and decision trees
Implement different methods of feature selection with Python
Learn why less (features) is more
Reduce the feature space in a dataset
Build simpler, faster and more reliable machine learning models
Analyse and understand the selected features
Discover feature selection techniques used in data science competitions

Course content

12 sections • 88 lectures • 5h 50m total length

Course Curriculum Overview3:33
Course requirements3:00
Course Aim1:44
Optional: How to approach this course1:00
Course Material2:01
The code | Jupyter notebooks0:33
Presentations covered in this course0:19
Download the data sets0:38
Resources to learn machine learning skills0:24

Filter Methods with other metrics3:04
Univariate model performance metrics5:52
Univariate model performance metrics | Demo4:23
KDD 2009: Select features by target mean encoding6:39
KDD 2009: Select features by mean encoding | Demo6:59
Learn how to perform mean encoding based feature selection for the Titanic dataset using KDD 2009, encoding categorical variables with target means, binning ages and fares, and evaluating with ROC-AUC.
Univariate model performance with Feature-engine4:54
Target Mean Encoding Selection with Feature-engine5:20
? Unveiling the Dark Side of Algorithms: A Captivating Book Recommendation!0:17

Requirements

A Python installation
Jupyter notebook installation
Python coding skills
Some experience with Numpy and Pandas
Familiarity with Machine Learning algorithms
Familiarity with scikit-learn

Description

Welcome to Feature Selection for Machine Learning, the most comprehensive course on feature selection available online.

In this course, you will learn how to select the variables in your data set and build simpler, faster, more reliable and more interpretable machine learning models.

Who is this course for?

You’ve given your first steps into data science, you know the most commonly used machine learning models, you probably built a few linear regression or decision tree based models. You are familiar with data pre-processing techniques like removing missing data, transforming variables, encoding categorical variables. At this stage you’ve probably realized that many data sets contain an enormous amount of features, and some of them are identical or very similar, some of them are not predictive at all, and for some others it is harder to say.

You wonder how you can go about to find the most predictive features. Which ones are OK to keep and which ones could you do without? You also wonder how to code the methods in a professional manner. Probably you did your online search and found out that there is not much around there about feature selection. So you start to wonder: how are things really done in tech companies?

This course will help you! This is the most comprehensive online course in variable selection. You will learn a huge variety of feature selection procedures used worldwide in different organizations and in data science competitions, to select the most predictive features.

What will you learn?

I have put together a fantastic collection of feature selection techniques, based on scientific articles, data science competitions and of course my own experience as a data scientist.

Specifically, you will learn:

How to remove features with low variance
How to identify redundant features
How to select features based on statistical tests
How to select features based on changes in model performance
How to find predictive features based on importance attributed by models
How to code procedures elegantly and in a professional manner
How to leverage the power of existing Python libraries for feature selection

Throughout the course, you are going to learn multiple techniques for each of the mentioned tasks, and you will learn to implement these techniques in an elegant, efficient, and professional manner, using Python, Scikit-learn, pandas and mlxtend.

At the end of the course, you will have a variety of tools to select and compare different feature subsets and identify the ones that returns the simplest, yet most predictive machine learning model. This will allow you to minimize the time to put your predictive models into production.

This comprehensive feature selection course includes about 70 lectures spanning ~8 hours of video, and ALL topics include hands-on Python code examples which you can use for reference and for practice, and re-use in your own projects.

In addition, I update the course regularly, to keep up with the Python libraries new releases and include new techniques when they appear.

So what are you waiting for? Enroll today, embrace the power of feature selection and build simpler, faster and more reliable machine learning models.

Who this course is for:

Beginner Data Scientists who want to understand how to select variables for machine learning
Intermediate Data Scientists who want to level up their experience in feature selection for machine learning
Advanced Data Scientists who want to discover alternative methods for feature selection
Software engineers and academics switching careers into data science
Software engineers and academics stepping into data science
Data analysts who want to level up their skills in data science

Feature Selection for Machine Learning

What you'll learn

Explore related topics

Course content

Introduction9 lectures • 13min

Feature Selection7 lectures • 33min

Filter Methods | Basics7 lectures • 34min

Filter methods | Correlation9 lectures • 35min

Filter methods | Statistical measures12 lectures • 1hr 13min

Filter Methods | Other methods and metrics8 lectures • 37min

Wrapper methods11 lectures • 36min

Embedded methods | Linear models5 lectures • 19min

Embedded methods – Lasso regularisation4 lectures • 13min

Embedded methods | Trees4 lectures • 16min

Requirements

Description

Who this course is for: