Name: ChatGPT for Data Science and Machine Learning
Rating: 4.4 (59 reviews)

Udemy Business

Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Created byJones Granatyr, AI Expert Academy

Last updated 11/2025

English

What you'll learn

Use ChatGPT as a co-pilot in your Data Science and Machine Learning projects
Implement projects without typing a single line of code, as ChatGPT will handle code generation
Extract insights from datasets with the assistance of ChatGPT
Generate Python code, from data loading to result analysis
Generate Python code for natural language processing projects

Course content

4 sections • 37 lectures • 4h 6m total length

Course content11:43
Starting ChatGPT8:51
ChatGPT for DS and ML7:38
Course materials0:08

Basic statistics 16:16
Learn to start ChatGPT conversations for data science with a census dataset to predict income, list features, handle missing values, and generate Python scripts for preprocessing and classification.
Basic statistics 26:06
Generate Python code in Colab to load census.csv, display basic statistics and info for numerical columns, and show unique value counts for categorical features and the income target.
Basic statistics 39:02
Generate and visualize statistics for numeric columns with a Python script, then use ChatGPT to interpret results, identify insights like class imbalance and correlations, and guide data preprocessing.
Missing values 18:23
Explore identifying and replacing interrogation marks as missing values in a dataset using Python and pandas in Google Colab, focusing on categorical columns like work class, occupation, and native country.
Missing values 27:54
Outliers7:37
Relation between variables8:17
Explore correlation analysis between education and income, age and hours per week, and capital gain and capital loss, and visualize these relationships with Python using scatter, box, and bar plots.
Charts - categorical features10:02
Visualize categorical columns with bar, pie, treemap, and heatmap charts using Python scripts, while handling errors when income is categorical and generating per-attribute charts of unique values.
Charts - numerical features7:40
Generate and compare ten chart types for numerical attributes with ChatGPT, from histograms and box plots to heatmaps and pair plots, then select and refine visuals in Colab.
Dynamic charts6:45
Learn to generate dynamic charts in Python with Plotly, including dynamic correlation matrices and dynamic 3D scatterplots, and iteratively refine code to display values.

Categorical features - LabelEncoder6:12
Unbalanced data7:50
explores class imbalance in the income attribute using oversampling, undersampling, and smote to balance data, visualize distributions, and apply preprocessing for more reliable models.
Categorical features - OneHotEncoding7:50
One hot encoding turns categorical features into binary columns, preserving information and avoiding numerical assumptions. It shows when to apply it to linear, logistic, and kNN models in census study.
Feature scaling6:53
Apply feature scaling to align numeric attributes on a common scale. Compare normalization and standardization, note outlier sensitivity, and demonstrate standardization with a sklearn script on the encoded data.
Train and test data3:59
Learn to split data into training and testing sets, using train_test_split and cross-validation, with 70–80% training and 30% testing, plus one-hot encoded features and smote resampling.
Algorithms and evaluation10:08
Explore classification algorithms in sklearn and evaluate ten models using train and test data, reporting accuracy, precision, recall, F1, and confusion matrices to compare performance.
Cross validation 17:02
Explore cross-validation to evaluate multiple algorithms—train and test across k folds, report mean accuracy and standard deviation for classifiers like decision tree, random forest, AdaBoost, and QDA.
Cross validation 25:05
Interpret cross-validation results across classifiers using accuracy, precision, recall, and F1 score; random forest achieves about 88% accuracy with a stable standard deviation, outperforming the other models.
Hyperparameter tuning4:56
Tune hyperparameters for a random forest to improve results using grid search cross-validation on a trimmed parameter grid including number of estimators, max depth, and min samples split.
Final model3:56
Save and load a trained random forest with the best hyperparameters using joblib, run predictions on sample data, and test results for app development.
Feature selection9:24
Learn feature selection with x_resampled and y_resampled, identify the top five attributes by feature importance, and compare full versus reduced data random forest models with re-applied preprocessing.
HOMEWORK - regression0:22
Solution 19:52
Predict house prices using a regression dataset by analyzing features like bedrooms, bathrooms, sqft, and living area, and learn to read the house prices csv and compute basic statistics.
Solution 27:30
Solution 37:00
Generate and explore charts to visualize correlations between price, sqft living, grades, condition, and renovation years, using scatter plots, histograms, box plots, and a location map.
Solution 46:06
Standardize the dataset (excluding price), split 70/30 into training and testing sets, and evaluate multiple regression algorithms, with random forest and gradient boosting performing best and SVR the worst.
Solution 55:28
Twitter dataset 19:48
Use a Twitter dataset to classify tweets as positive or negative by analyzing text attributes, balancing a 10,000-tweet sample, and generating insights with word clouds and preprocessing in Python.
Twitter dataset 26:37
Continue exploring the Twitter data set by regenerating statistics for the balanced set sample, analyzing sentiment distribution, tweets by day, and word clouds with stop-word removal.
Pre-processing texts3:30
Implement text preprocessing to prepare data for machine learning by lowercasing, removing HTML, URLs, usernames, numbers, punctuation, and stopwords; then tokenize to remove stopwords in the balanced sample.
Sentiment analysis7:29

Requirements

Programming language
Basic Python programming
Machine Learning basics

Description

This course covers an exciting journey in the application of ChatGPT in the field of Data Science and Machine Learning. Throughout this program, you will explore ChatGPT's ability as a valuable tool in data analysis, preprocessing, and machine learning model building without needing to write a single line of code!

In the first part, we will dive into fundamental data analysis techniques. You will learn how to extract crucial statistical information from your datasets, handle missing values, and identify and treat outliers. We will explore the relationships between variables and the visual representation of categorical and numerical data. Additionally, you will have the opportunity to create interactive graphs, making data exploration more engaging and informative. In the second part, we will delve deeper into the field of machine learning, and you will learn how to handle categorical attributes using techniques like LabelEncoder and OneHotEncoding. We will address the challenge of imbalanced datasets and discuss the importance of feature scaling. You will also gain experience in effective data splitting, selecting appropriate algorithms, and evaluation methods. Cross-validation, parameter tuning, and feature selection are essential parts of the modeling process, and you will have the opportunity to enhance your skills in these areas.

Upon completing this course, you will be equipped with advanced skills in data science and machine learning, empowered to effectively apply ChatGPT in real-world projects. This program offers a unique opportunity to enhance your analytical skills and stand out in the field of data science and machine learning. Get ready to reach a new level in your professional career!

Who this course is for:

Professionals who want to explore how ChatGPT can be integrated into their data analysis and machine learning modeling activities
Professionals working on machine learning model development who wish to understand how ChatGPT can be a valuable tool in the process
Students looking to acquire knowledge in data analysis and machine learning, incorporating ChatGPT into their skill set
Individuals with technical programming knowledge who want to expand their skills in data analysis and machine learning with the assistance of ChatGPT
People working with data analysis who want to enhance their techniques and approaches using ChatGPT
Enthusiasts who want to explore the use of ChatGPT as a supportive tool in their machine learning activities

What you'll learn

Explore related topics

Course content

Introduction4 lectures • 28min

Data analysis and exploration10 lectures • 1hr 18min

Machine learning21 lectures • 2hr 17min

Final remarks2 lectures • 3min

Requirements

Description

Who this course is for: