
Explore feature selection for tabular data using Python, covering filter and embedded methods, mutual information, backward elimination, and random forest guidance with practical code examples.
Explore feature selection for tabular data in Python, focusing on selecting features that affect the target variable and comparing three methods, including an embedded method.
The lecture introduces filter methods for feature selection, scoring each column and selecting features based on mutual information with the target for fast processing of tabular data.
Apply variance-based feature selection to a Python regression dataset, scale features, and evaluate with mean absolute error. Compare thresholds to show how dropping low-variance features affects model performance.
Implement chi2 for feature selection in a Python tabular data workflow, build and train a random forest classifier, and assess model accuracy on training and testing data.
Learn how forward feature selection in mlxtend builds feature subsets for tabular data, optimizing model accuracy in classification by evaluating sequentially added features.
In Project 12, implement backward feature elimination for a tabular dataset using Python, split data into training and testing sets, and iteratively remove features to evaluate classification accuracy.
Explore tree-based methods for embedded feature selection using Python, including random forest, gradient boosting, and decision trees, and learn how to derive feature importance to select key attributes.
Explore regularization methods for feature selection in Python for tabular data, focusing on lasso and elastic net, which shrink or drop coefficients to remove non-significant features and prevent overfitting.
Implement logistic regression with lasso regularization to perform feature selection on tabular data, fit on training data, predict on test data, and evaluate with a confusion matrix.
Explore the benefits of embedded feature selection, where selection occurs during model training, offering simplicity, no extra processing time, and often greater accuracy than filter methods.
Feature selection is one of most important activity in machine learning/Artificial Intelligence pipeline. We select all relevant features for machine learning algorithm and discard less relevant or not relevant features. Feature selection is also known as variable selection.This course will provide learner, detailed knowledge of feature selection. It is one of most detailed online course on feature selection.
Who is this course for ?
Data scientist who wants to create faster and more interpretable machine learning models.
Data analyst who wants to relation between two variables.
Data science aspirants who are preparing for data science interview.
Any One who wants to learn about feature selection process.
AI/ML software engineer who write code for machine learning.
Teachers who are teaching Machine Learning Models.
What will you learn ?
In this course, you are going to learn feature selection by doing. I have included more than 8 end to end small projects on feature selection methods. Each method has one project so that learner can understand the process fully. Code provided in throughout course is downloadable. You can download code and data and run by yourself to get confidence. Knowledge gain though this course is precious and can be used in We are going to learn following topics.
What is feature selection?
Different methods of feature selection.
Filter methods
Minimum variance method
F-Score using correlation for regression analysis data.
Anova F for classification analysis data
Mutual Information for regression and Classification analysis data.
Chi-Square Scores for categorical features and Target
All these methods implementation using sklearn
Wrapper Method
Forward selection of features.
Backward selection of features.
Exhaustive feature selection.
Implementation of each using sklearn and mlxtend.
Embedded Method
Introduction to Embedded Method for feature selection.
Using RandomForest
Using Extremely randomized trees to select features
Regularization based feature selection
So what are you waiting for? Join the course and get the knowledge of variable selection and apply it in your projects to get efficient and interpretable machine learning models.