Name: Data Analytics, Data Science, & Machine Learning - All in 1
Rating: 4.5 (4657 reviews)

Udemy Business

Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Created byAnalytix AI

Last updated 2/2026

English

What you'll learn

Understand data science foundations, applications, and the path to becoming a data scientist.
Analyze data using Python programming with variables, loops, functions, and OOP.
Apply statistics and probability with distributions, hypothesis testing, and inference in Python.
Perform data cleaning, transformation, and EDA using pandas and NumPy.
Visualize data with Python using bar charts, histograms, scatterplots, heatmaps, and box plots.
Build regression, classification, and clustering models with scikit-learn and evaluate performance.
Master advanced ML techniques like cross-validation, feature engineering, regularization, and hyperparameter tuning.
Implement ensemble learning methods such as Random Forest, AdaBoost, CatBoost, LightGBM, and XGBoost.
Explore deep learning with neural networks and TensorFlow, from preprocessing to model evaluation.
Gain hands-on experience through real-life projects and assessments to build a strong portfolio.
Acquire Excel, SQL, Python, Power BI, and ChatGPT skills to prepare for a data analyst career.
Learn data analysis foundations with statistics, hypothesis testing, and machine learning.
Use Excel for data cleaning, manipulation, formulas, functions, graphs, and charts.
Apply Excel advanced tools like pivot tables, Analysis ToolPak, and interactive dashboards.
Understand RDBMS fundamentals including keys, data types, and relational models.
Work with MySQL for table manipulation, constraints, indices, filtering, and joins.
Learn Python basics including variables, data types, lists, dictionaries, loops, and functions.
Master Python for data cleaning, manipulation, preprocessing, and transformation.
Use Python for visualization, exploratory analysis, statistics, and ML modeling.
Utilize ChatGPT for data manipulation, merging, pivot tables, and conditional logic.
Apply ChatGPT for predictive analytics with Random Forest and ML models.
Learn Power BI for data manipulation, analysis, and dashboard insights.
Create professional, story-driven dashboards in Power BI with impactful visuals.
Complete 30+ assignments, 120+ coding exercises, and 10 quizzes with 100+ questions.
Accomplish 4 capstone projects: bank churn analysis, sports analytics, HR data management and website performance analysis.
Accomplish 7 AI projects: Image Captioning, Chatbot, Voice Assistant, Text to Image, Video Summarizer, Language Translator and Data Analyst AI

Coding Exercises

This course includes our updated coding exercises so you can practice your skills as you learn.

Course content

15 sections • 517 lectures • 66h 16m total length

Only for my Udemy Students!0:20
Data Analysis with Practical Example7:49
Type of Data Analysis in Real-world14:51
Widely Used Tools of Data Analysis13:33
The 8 - Steps in Data Analysis39:23
Interview for the Data Analyst's Position
Test on Fundamentals of Data Analysis

All notes on Python Fundamentals0:03
Installing Python & Jupyter notebook0:08
Note on python data analysis0:08
Datasets used in the course0:07
Understanding Expressions and Variables6:31
Explore expressions and variables in Python, including operands, operators, and how Python evaluates expressions to results, and learn assignment to store and reuse values with examples like five plus three.
Hands-on Lab: Expressions and Variables7:40
Practice expressions and variables in Python to calculate total eggs bought, total eggs used, and remaining eggs across three days using addition, subtraction, and print statements.
Creating variables
Understanding Data Types6:49
Explore the definition, purpose, and features of Python data types: integers, floats, strings, and booleans. Learn typecasting with int, float, str, and bool to convert between data types.
Hands-on Lab: Python Data Types5:17
Practice Python data type conversion through a hands-on lab that converts string to integer, float to string, and boolean to integer, while verifying types with the type function.
Converting data types #1
Converting data types #2
Converting data types #3
Various String Operators11:59
Master Python string handling by exploring quotes, word strings, indexing from zero and negative indexing, slicing, len, concatenation, escape sequences, and common methods like upper, lower, and replace.
Hands-on Lab: Various String Operators12:32
Explore hands-on practice with various string operators in Python, including indexing, slicing, negative indexing, substrings, step-based slicing, length, concatenation, escape sequences, case conversion, and replacement.
Starting with Variables to Data Types0:03
Understanding Tuples and Lists17:24
Explore Python data structures by learning tuples, their immutability and order, along with list operations, slicing, concatenation, and nested structures such as lists of lists.
Hands-on: Tuples and Lists10:46
Explore hands-on Python techniques for tuples and lists, including nested tuple and nested list slicing, negative indexing, and stepwise extraction, along with append and remove operations.
Creating list
Indexing list
Slicing list
Adding element
Removing element
Replacing element
Operations & Manipulation of Sets10:14
Explore how Python sets ensure unique elements and remove duplicates, are unordered and mutable, and how to convert lists to sets, add and update items, and perform union and intersection.
Hands-on Lab: Sets6:05
Explore working with sets in Python via a hands-on lab in Jupyter Notebook, converting lists to sets, adding and removing elements, and performing union and intersection.
Union sets
Reducing sets
Working with Dictionaries9:05
Learn how Python dictionaries store data as unique and immutable key–value pairs, access values by keys, and add or remove entries with keys, values, and items.
Hands-on Lab: Dictionaries6:43
Explore working with dictionaries in Python via a hands-on lab in a Jupyter notebook. Practice slicing and extracting values, adding keys, deleting entries, and listing keys, values, and items.
Create dictionary
Adding keys and values
Several data structures0:03
Condition and Branching14:43
Explore how conditions and branching control Python programs with if, else, and elif. Master boolean logic, comparison operators, and the or and not operators, with examples such as voting eligibility.
Hands-on Lab: Condition & Branching14:28
Apply Python conditionals and branching to solve real problems by classifying animals by speed, evaluating discounted ticket eligibility, and determining eco friendly car race eligibility with nested if statements.
Conditional statement #1
Conditional statement #2
Logical expression #1
Logical expression #2
Logical expression #3
Loops for Iteration12:09
Explore loops for iteration in Python, including for loops, while loops, range, and enumerate, to repeat tasks over lists, strings, and tuples.
Hands-on Lab: Loops13:47
Explore hands-on loop concepts with for and while loops, enumerating tasks, printing welcome messages, generating even numbers, and classifying numbers as positive, negative, or zero.
For loop
While loop
Developing Functions18:42
Develop and use functions with def, arguments, and return, and distinguish between return and print. Explore global and local variables, and examples like temperature conversion and circle area.
Hands-on Lab: Python Functions13:15
Develop Python functions to calculate compound interest, EMI, and BMI, applying the defined principles and formulas to real-world savings, loan payments, and health assessments.
Object and Classes10:46
Learn how Python uses a class as a blueprint to create objects with attributes and methods, using a car example to show brand and wheels, and start_engine.
Hands-on Lab: Object and Classes14:24
Develop an object-oriented library system in Python by building a book class with title, author, and availability, and adding borrow, return, and check availability methods.
Dealing with function #1
Dealing with function #2
Conditionals Looping and Functions0:03
API, REST API & Request10:33
Explore how application programming interfaces enable communication between apps and servers, use Rest principles, and perform create, read, update, delete operations via http methods, endpoints, and api keys.
HTML and BeautifulSoup12:49
Learn HTML fundamentals and how to apply BeautifulSoup for web scraping, including parsing and navigating HTML, using requests to fetch pages, and extracting links with find_all and href.
Hands-on Lab: BeautifulSoup22:24
Learn web scraping with BeautifulSoup and pandas to collect data, parse HTML, extract headings, paragraphs, links, and tables, convert to a dataframe, and export to Excel.
Web scrapping in Python
Open() to import data8:59
Explore how the Python open function reads and writes text files, using r, w, and a modes, read with read, readline, and for loops, then close the file.
Hands-on Lab: Open()9:39
Practice using the Python open function to read and write text files. Learn to read full content and the first line, create samsung.txt, and append to apple incorporation.txt.
Reading and Writing with Pandas11:59
Learn to read and write data with the pandas library in Python, using read_csv and read_excel, manipulate dataframes, and preview datasets with head and tail.
Hands-on Lab: Importing datasets6:26
Use pandas to read the Excel file, load February sheet into a dataframe, print the first ten rows, convert a dict to a dataframe, and export as CSV without indices.
Reading and Writing JASON & XML6:52
Learn to read and write JSON and XML data using Python, including json.load and json.loads for files and strings, and xml.etree.ElementTree for parsing and accessing tags.
Hands-on Lab: Importing JASON & XML9:28
Learn to read JSON and XML files in Python by loading JSON data and printing student names and grades, then parsing XML to print employee names and positions.
Exception Handling9:01
Apply Python's try, except, else to handle runtime errors such as syntax, value, zero division, and file not found, improving robustness for user input, file I/O, and API use.
Hands-on Lab: Exception Handling7:13
Practice exception handling in python with two problems: handle data dot txt not found via try and accept, then manage zero division and value errors on user input.
Reading material: Errors in Python1:36

Required Dataset0:07
Preparing Notebook & Loading Data13:14
Loading a CSV Data
Deep Dive into Missing Values11:26
Hands-on Python for Defining Missing Value5:02
Identify missing value
Hands-on Python for Imputing Missing Value13:23
Imputing missing values
Explore Data Types in Python17:19
Hands-on Python for Defining Data Types5:37
Hands-on Python for Dealing with Data Types4:33
Checking data type
Assigning data type
All About Data Inconsistency8:53
Hands-on Python for Correcting Inconsistency8:33
Finding the unique values
Removing inconsistent value
Deep Dive into Duplicates6:58
Hands-on Python for Correcting Duplicates5:20
Identify duplicates
Removing duplicates
Test on Data Cleaning
Data Sorting in Python7:02
Hands-on Python for Sorting Data5:51
dataset sorting
Data Filtering in Python7:33
Hands-on Python for Boolean Filtering8:47
Boolean filtering
Hands-on Python for Query Filtering5:59
Query Filtering
Hands-on Python for IsIn Method Filtering5:37
IsIn Filtering
Hands-on Python for Loc-iLoc Filtering10:21
Slicing with loc
Slicing with iloc
Hands-on Python for Multiple Filtering7:19
Multiple conditions
Data Joining in Python7:48
Hands-on Python for Horizontal Joining7:26
Merging dataframes
Hands-on Python for Vertical Joining8:33
Concatenating dataframes
Python Codes in the Practice0:02
Test on Data Manipulation
Frequency & Percentage Analysis6:23
Hands-on Python for Frequency & Percentage6:56
Value counts method
Group by Analysis6:20
Hands-on Python for Group by Analysis5:41
Group by method
Pivot Table Analysis7:56
Hands-on Python for Pivot Table Analysis13:42
PIVOT table analysis
Cross-Tabulation Analysis6:40
Hands-on Python for Cross-Tab Analysis4:42
Cross-tab analysis
Test on Exploratory Data Analysis
Data Visualization Methods Part 110:14
Understand Matplotlib Pyplot6:46
Hands-on Python for Bar Chart12:18
Bar Chart
Hands-on Python for Pie Chart6:23
Pie chart
Hands-on Python for Line Chart7:53
Line chart
Hands-on Python for Histogram10:30
Histogram
Data Visualization Methods Part 29:07
Hands-on Python for Stacked Barchart9:40
Stacked Bar Chart
Hands-on Python for Scatterplot9:03
Scatterplot
Hands-on Python for Heatmap8:01
Heatmap
Hands-on Python for Boxplot8:40
Boxplot
Hands-on Python for KDE plot21:31
Kdeplot for distribution
Python Codes in the Practice0:02
Download the python (.py) file. Check the given codes. Practice yourself.
Test on Data Visualization

All notes on Data Science0:03
Important Messages for You!0:23
What is Data Science14:11
Explore how data science blends statistics, computer science, and machine learning to derive insights, using data cleaning, analysis, visualization, and predictive analytics across industries.
Fundamentals of Data Science12:18
Explore data science fundamentals: linear algebra, probability, statistics, and calculus. Learn Python or R and key libraries—numpy, pandas, matplotlib, seaborn, scikit-learn—for data manipulation, visualization, and supervised or unsupervised learning.
The path to be a Data Scientist14:25
Explore the path to becoming a data scientist by mastering linear algebra, probability, statistics, data manipulation, visualization, and machine learning with Python or R.
Knowledge Assessment 1
Data Analysis15:06
Explore the data analysis process: cleaning, transforming, and uncovering patterns to inform decisions. Master descriptive, diagnostic, predictive, and prescriptive analysis using Excel, SPSS, R, Python, Tableau, Power BI, and SQL.
Business Intelligence14:41
Explore business intelligence as tools and practices that gather, integrate, analyze, and present data via dashboards and KPIs to guide data-driven decisions.
Statistical Modeling19:21
Explore how statistical modeling uses mathematical relationships to analyze and predict outcomes from data, employing techniques like linear regression and time series to support data-driven business decisions.
Knowledge Assessment 2
Machine Learning16:06
Explore machine learning as a data-driven subset of artificial intelligence, covering supervised, unsupervised, and reinforcement learning, with data collection, preprocessing, training, evaluation, and deployment for scalable predictions and personalization.
Deep Learning17:31
Dive into deep learning, a multi-layer neural network approach that automatically extracts features from raw data to handle complex tasks like image recognition, speech recognition, and autonomous driving.
Artificial Intelligence10:28
Explore artificial intelligence, its differences from machine learning and deep learning, and how AI powers supply chain optimization, personalized recommendations, chatbots, voice assistants, and autonomous vehicles.
Knowledge Assessment 3
Traditional Data vs Big Data12:50
Compare traditional data and big data, highlighting structured data in relational databases and five V's—volume, velocity, variety, veracity, value—and introduce sources and tools like Hadoop, Spark, NoSQL, and data lakes.
Working with Big Data5:38
Explore big data concepts, including distributed storage with Hadoop, data lakes, and scalable cloud storage, and learn batch, real-time, and stream processing, data mining, predictive analytics, ETL, and data governance.
Real - life examples of Big data3:25
Knowledge Assessment 4
Database management tools10:33
Explore database management tools such as Apache Hadoop, Microsoft SQL Server, Oracle, and MongoDB for handling big data, unstructured data, and distributed processing with scalable, fault-tolerant analytics.
Programming languages7:41
Explore Python’s simple syntax, its NumPy and pandas data analysis libraries, and its scikit-learn, TensorFlow, and PyTorch machine learning capabilities, alongside R’s statistics and ggplot2 visualizations for data science.
360 Data analytics tools9:09
Explore 360 data analytics tools, from Excel to IBM SPSS, for data analysis and visualization, with insights on when to use each and their limitations.
Data visualization tools7:06
Compare Power BI and Tableau as data visualization tools, highlighting Microsoft integration, live data dashboards, real-time analytics, cost considerations, and suitability for small to large enterprises.
Development environments8:07
Explore Jupyter notebook as an open source, interactive platform for live code and visualizations, and note its language flexibility (Python, R, Julia) and cloud options (Azure, Google Colab) for collaboration.
Knowledge Assessment 5
Step 1 - Business understanding8:28
Identify the business problem as the first step in data science to guide data collection, methods, and success metrics. Apply the five whys to uncover root causes.
Step 2 - Data collection7:59
Collect relevant data through online survey tools (Google Forms, SurveyMonkey) and offline methods to inform decision making, ensuring data accuracy, reliability, and validity with appropriate sampling, time, and cost.
Step 3 - Data preparation5:48
Prepare raw data by cleaning, manipulating, and transforming it for analysis, ensuring accuracy, consistency, and readiness for machine learning.
Step 4 - Data modeling8:07
Build a structured data model that maps relationships and constraints for reliable analysis. Emphasize data quality, feature relevance, and the choice between statistical and machine learning approaches.
Step 5 - Model evaluation15:04
Evaluate your model on unseen data with regression and classification metrics: MAE, MSE, RMSE, R-squared, precision, recall, F1, AUC, ROC; identify overfitting or underfitting, then tune hyperparameters for deployment.
Step 6 - Model deployment6:27
Deploy your trained model to a real-world environment, delivering real-time predictions via cloud, edge, or container-based deployments while ensuring security, latency, and ongoing maintenance.
Knowledge Assessment 6

All notes on Data Analytics0:03
Extra note on analytical world of data0:09
Various sources of collecting data5:27
Explore primary, secondary, and tertiary data sources, weighing advantages and challenges to guide reliable data collection for research and analytics.
Population v/s sample and its methods11:05
Clarify the population versus the sample and explore sampling methods like simple random, stratified, systematic, cluster, convenience, and snowball sampling, with their advantages and challenges.
The application of statistical test8:31
Discover the basics of statistical data analysis, from cleaning and transforming data to modeling, using tools like R, Python, and SPSS for descriptive statistics, regression, and multivariate analysis.
Types of statistical data analysis4:58
Explore descriptive and inferential statistics, summarize data with metrics and charts, and infer population trends from samples using tests like the t test.
T-tests and ANOVA6:54
Explore inferential statistics methods such as one sample t test, independent and paired t tests, and one way ANOVA to assess mean differences in populations and groups.
Relationships measures3:16
Explore chi-square tests for independence to assess associations between categorical variables via observed and expected frequencies, and apply Pearson correlation to quantify linear relationships between numeric or ordinal variables.
Regression analysis11:53
Learn how linear regression links a dependent variable to one or more independent variables, using simple and multiple models, the regression equation, R square, and beta values to predict outcomes.
Statistical data analysis
Probability in data analysis8:10
Explore how probability measures likelihood in data analysis, using the die example and the favorable-outcome over total-outcome formula to support decision making and risk assessment in statistics.
Classical probability4:58
Explore classical probability by counting equally likely outcomes, such as coin flips and marble draws. Apply this approach to real-world decisions, like forecasting demand for birthday versus wedding decorations.
Empirical probability5:21
Explore empirical probability, based on actual experiments, by dividing observed events by total trials, with real-world examples like vintage t-shirt sales at 50% and red marbles at 40%.
Conditional probability7:07
Explore conditional probability and its real-world use in data analysis, including calculating P(A|B) with business cases and card-draw examples.
Joint probability4:47
Explore joint probability by showing how two events occur together, illustrated with a sunscreen and sunglasses bundle and red queens in a deck.
Hypothesis testing for inferential statistics6:03
Explore how hypothesis testing guides decision making in inferential statistics by evaluating sample data against null and alternative hypotheses. Apply test statistics and p-values to determine conclusions and quantify evidence.
Selecting statistical test and assumption testing10:26
Learn to select the appropriate statistical test for a scenario and hypothesis, from t tests to ANOVA, chi square, correlation, and regression, and check normality, linearity, and homoscedasticity.
Confidence level, significance level, p-value3:37
Explore confidence level, significance level, and p value in hypothesis testing, and learn how these measures guide decisions and conclusions.
Making decision and conclusion on findings2:55
Make informed decisions and conclusions by comparing a calculated p value to a 5% significance level, deciding between null and alternative hypotheses, and stating the conclusion.
Complete statistical analysis and hypothesis testing7:01
Compare two independent classes to test a new teaching method with a step-by-step hypothesis testing workflow, including H0, H1, alpha 0.05, Shapiro-Wilk, and t test.
Hypothesis Testing in Statistical Analysis
Transforming data for improved analysis5:14
Transform data to improve quality and model readiness by normalizing scales, mitigating skew with log or Box-Cox transforms, handling outliers, creating features, reducing dimensionality, and encoding categorical variables.
Techniques for data transformation Part 16:11
Master data transformation techniques to reduce skewness and improve model performance, including logarithmic and Box-Cox transformations, binding, and one-hot encoding.
Techniques for data transformation Part 24:07
Explore creating new features from revenue and cost to form profit, extract day, month, and year from dates, and apply standardization, normalization, and PCA to prepare data for analysis.
Several methods of data visualization Part 14:04
Explore data visualization methods, including bar charts for category comparisons, stacked bar charts for totals and subcategories, and line graphs to reveal trends over time.
Several methods of data visualization Part 25:38
Master data visualization by comparing pie charts for proportions, histograms for numeric distributions, scatter plots for relationships, and heatmaps that use color to reveal correlation patterns and the correlation coefficient.
Several methods of data visualization Part 37:17
Explore area charts to show change and differentiate trends with colored areas; compare to line charts, study bubble plots with size signaling frequencies, and box plots show quartiles and outliers.
Understanding Data Transformation

All notes on ML, DL & AI0:03
ML for data analysis and decision-making6:07
Explore how machine learning from data analysis informs decision making across industries. Learn about predictive analytics, real time analysis, and customer personalization for business impact.
Widely used ML methods in the data analytics11:11
Explore supervised and unsupervised machine learning in data analytics, including classification, regression, logistic regression, decision trees, and random forests, with practical examples.
Steps in developing machine learning model7:43
Define the core problem with inputs and outputs, then collect, clean, and engineer features. Choose a model, train and evaluate with accuracy, MSE, and MAPE, deploy, and monitor.
Machine learning basics
What is Machine learning?8:22
Explore how machine learning learns from training data to predict and decide, using supervised, unsupervised, and reinforcement learning, with real-world applications like spam filtering, recommendations, and fraud detection.
Supervised Regression models9:52
Explore supervised regression models, including linear regression, svr, random forest regressor, ridge regression, and polynomial regression, highlighting robustness, least squares minimization, and when to apply each.
Supervised Classification models9:47
Explore supervised classification models such as logistic regression, SVM, random forest, Naive Bayes, and KNN, and learn how they assign class labels from probabilities using thresholds like 0.5.
Unsupervised clustering models9:10
Explore unsupervised clustering models such as k-means and DBSCAN to discover natural data groupings without predefined labels, evaluate with the elbow method, and handle outliers.
Model evaluating metrics17:44
Assess model performance using classification and regression metrics, including accuracy, precision, RMSE, ROC AUC, and confusion matrices, to compare models on unseen data.
Overfitting & Underfitting7:19
Identify how overfitting and underfitting hinder generalization, and apply fixes like regularization, early stopping, more data, and adjusting model complexity to balance bias and variance.
Imbalanced data problem8:15
Explore imbalanced data, where the majority class dominates, and learn F1 and ROC AUC metrics plus SMOTE and weighted loss functions.
Knowledge Assessment 23
What is Matrix?15:41
Explore matrices as grids of numbers with rows and columns, cover operations like addition, subtraction, scalar and matrix multiplication, and review forms such as square, diagonal, zero, and identity matrices.
Scalars and Vectors10:35
Explore scalars and vectors, where scalars have magnitude only and vectors have magnitude with direction, represented as arrows, with vector addition and scalar multiplication.
Linear algebra introduction11:13
Explore linear algebra foundations with vectors, matrices, and linear transformations, and learn how these tools solve systems and power data science and machine learning through eigenvalues and eigenvectors.
What is Tensor?6:37
Explore tensors in linear algebra, from scalars, vectors, and matrices to multi-dimensional data like color images, and learn how order, shape, and axes enable deep learning and complex simulations.
Transpose of Matrix7:49
Explore the transpose of a matrix by flipping rows into columns, note dimension changes, and apply (A+B)^T = A^T + B^T and (AB)^T = B^T A^T.
Dot product and Matrix6:31
Explore the dot product as matrix multiplication, multiplying rows by columns and summing results. Learn dimensional rules, key properties, and practical takeaways for matrix operations.
Knowledge Assessment 21
How Linear regression works8:51
Explore linear regression, a supervised model, fitting a best line by minimizing the residual sum of squares to predict the target Y from input features X, using slope and intercept.
How Logistic regression works11:57
Explore how logistic regression performs classification by mapping features to probabilities with the sigmoid function, applying a 0.5 threshold to assign classes, and training with cross-entropy loss via gradient-based optimization.
K-fold cross validation12:09
Master k-fold cross validation, a resampling method that trains on k-1 folds and validates on the remaining fold to assess model performance on unseen data.
L1, L2 regularization13:42
Explore L1 and L2 regularization to prevent overfitting, compare feature selection and coefficient shrinkage, and learn how lambda controls penalties in regression and neural models.
The oversampling method10:34
Explore oversampling methods to address class imbalance in supervised learning by increasing minority class examples. Study random oversampling, SMOTE, and ADASYN, and prevent overfitting.
The undersampling method9:06
Balance imbalanced datasets by undersampling the majority class to match the minority, boosting model fairness and recall with methods like random undersampling, Tomek links, and near-miss.
How KMeans clustering works17:13
K-means clustering uses centroids and Euclidean distance to minimize intra-cluster variance and maximize inter-cluster separation, with elbow method guidance for choosing k.
How Decision tree regression works12:20
Explore how decision tree regression predicts continuous outcomes by greedily splitting feature space into rectangular regions and assigning a mean value per leaf, using MSE as the impurity measure.
How Decision tree classification works11:15
Explore how decision tree classification builds a feature-based decision tree to assign data to classes. Learn how splits use gini impurity or information gain and how depth controls overfitting.
How Random forest regression works11:19
Explore how random forest regression uses an ensemble of decision trees, bootstrap sampling, and random feature selection to predict continuous targets with improved accuracy and reduced overfitting.
How Random forest classification works7:14
Explore how random forest classification builds an ensemble of decision trees trained on bootstrap samples with random feature selection, then uses majority voting to boost accuracy and generalization.
How AdaBoost Models work8:42
Explore how AdaBoost builds a strong classifier by sequentially training weak learners, weighting misclassified samples, and using a weighted vote to boost accuracy, highlighting its adaptiveness and sensitivity to noise.
How Traditional GBM works13:04
Explore how traditional gradient boosting builds an additive ensemble of shallow decision trees, using gradient descent to minimize loss and correct residuals with each new tree.
How CatBoost Models work9:42
Explore how CatBoost uses ordered target statistics to encode categorical features and build symmetric trees with gradient boosting for fast, accurate, and robust overfitting control.
How LightGBM Models work10:47
Explore how Lightgbm, a fast, memory-efficient gradient boosting framework using leaf wise tree growth, optimizes classification, regression, and ranking on large datasets with gradient-based sampling, histograms, and cross-entropy loss.
How XGBoost Models work11:37
Explore how XGBoost, a fast, scalable gradient boosting library, optimizes classification, regression, and ranking through parallelized trees, regularization, sparsity handling, and custom loss functions.
What is Hyperparameter tuning?10:25
Explore hyperparameter tuning, including grid search, random search, and Bayesian optimization, to select learning rate, batch size, regularization, and other settings that prevent underfitting and overfitting while enabling cross-validation.
Understanding Deep Learning10:05
Deep learning uses multilayer neural networks to automatically learn hierarchical features from large data, with architectures like CNNs, RNNs, Transformers, and autoencoders, trained via forward propagation, backpropagation, and gradient descent.
Neural Networks in Deep Learning11:06
Discover how neural networks transform inputs into outputs through weighted sums with biases and non-linear activations, learned by forward and backward propagation, with CNNs, RNNs, and transformers.
What is TensorFlow?10:53
Explore TensorFlow's open source framework that uses data flow graphs and tensors to build, train, and deploy scalable machine learning models across devices, with a rich ecosystem and Keras integration.
How TensorFlow 2.0 works8:35
Explore how TensorFlow 2.0 works, emphasizing eager execution by default, the unified tf.keras API, and streamlined deployment with TensorFlow Lite, TensorFlow Serving, and TFX.
Knowledge Assessment 24
What is Initialization?6:29
Explore initialization in deep learning, setting starting weights and biases to guide training, prevent vanishing or exploding gradients, break symmetry, and speed convergence with Xavier/Glorot methods in TensorFlow models.
Glorot Initialization7:56
Glorot initialization, also called Xavier initialization, sets initial weights to keep activations and gradients balanced across layers, preventing vanishing or exploding gradients and favoring tanh or sigmoid activations.
Stochastic Gradient Descent6:18
Explore how stochastic gradient descent trains deep learning models by updating weights after each example using the loss gradient, offering fast, memory-efficient optimization that can help escape local minima.
Knowledge Assessment 25
AI history, definition and workflow6:32
Explore the history, definition, and workflow of artificial intelligence, from early computing to deep learning, neural networks, and current AI applications like natural language processing and computer vision.
Various types of Artificial intelligence5:52
Explore the three AI types by strength—weak (narrow), strong (generalized), and super (conscious)—and see how ethics, psychology, and computer science shape AI across fields.
Artificial v/s Augmented Intelligence4:36
Explore how human intelligence, artificial intelligence, and augmented intelligence collaborate to enhance decision making and a safer commute, highlighting the synergy between machine data processing and human insight.
Generative AI and Its use cases4:30
Explore generative AI and its diverse use cases across sectors, powered by deep learning and large language models to generate text, images, music, and video.
Traditional AI v/s Generative AI5:04
Traditional AI relies on an organization's repository, analytics platform, and application layer with a feedback loop to refine predictions; generative AI uses vast external data and prompting to tailor models.
Reading material: Types of AI2:47
AI use cases in Daily life6:16
AI use cases in life appear in voice assistants, smart home devices, and personalized recommendations. The lecture also covers security, health monitoring with wearables, camera enhancements, and real time navigation.
What is AI Chatbot?7:14
Explore ai chatbots and smart assistants that use natural language processing, dialogue management, deep learning, and machine learning to interpret input, detect intent, and deliver personalized responses.
Gen AI Tools and Applications6:17
Explore generative AI tools and applications across diverse fields. Learn how industry leaders integrate this technology and leverage multimodal llms that handle text, images, audio, and video.
Reading material: AI and Generative AI2:15
Various models of Generative AI8:02
Explore generative ai models and their types, including variational autoencoders, generative adversarial networks, autoregressive models, and transformers, and see how they generate text, art, music, and videos.
NLP, Speech Technology & Computer vision9:26
Explore natural language processing, speech technology, and computer vision to understand applications in industries and how NLP analyzes language, CT converts speech to text, and TTS generates speech for interaction.
AI, Cloud and Edge computing & IoT7:33
Learn how AI, cloud, edge computing, and IoT create intelligent, real-time applications by processing data from devices like fitness trackers and smart thermostats.
Reading material: The parts of AI + Gen AI3:15
Tools for Text Generation7:49
Discover tools for text generation powered by generative AI. Learn how large language models like GPT and Palm enable coherent, context-aware text and multimodal capabilities across chat and research tasks.
Generating Text with ChatGPT Using OpenAI's Website
Tools for Image Generation8:21
Discover the core capabilities of generative AI for image creation, including image to image translation, inpainting, outpainting, and style transfer, with tools like Dall-E, Stable Diffusion, and Midjourney.
Creating an Image Using Freepik
Tools for Code Generation8:02
Explore how generative AI powers code generation, completion, optimization, language translation, and documentation through tools like GPT, ChatGPT, Copilot, Polly Coder, IBM Watson, and more.
Generating Python Code with Gemini
Tools for Audio and Video Generation7:28
Explore generative AI tools for audio and video, including speech generation, music creation, and audio enhancement. Use text-to-speech, video tools, and avatar creation to boost accessibility and visuals.
Reading material: Gen AI Tools2:35
What is a Prompt?7:35
Define a prompt and its core components, and show how enriched prompts with context, input data, and output indicators guide generative AI models to precise, desirable outputs.
What is Prompt Engineering?6:41
Explore prompt engineering as the art of crafting precise prompts and system prompts to guide generative AI, defining goals, context, and expectations through an iterative process that yields accurate outputs.
Crafting Effective Prompts for AI Models
Best practices in Prompt engineering6:05
learn best practices for crafting prompts using four dimensions—clarity, context, precision, and role play—to unlock generative ai potential and control style, tone, and relevance.
Reading material: prompt engineering tools6:33
Interview pattern prompt technique3:53
Explain the interview pattern approach to prompt engineering and apply it to craft prompts that guide generative AI with detailed, tailored responses through structured follow-up questions.
Applying the Interview Pattern Approach to Prompt Engineering with Gemini
Chain-of-Thought prompt technique3:55
Learn the chain-of-thought prompt technique to guide AI reasoning through step-by-step prompts, using related questions and solved examples to improve accuracy.
Applying the Chain-of-Thought Approach with Gemini
Tree-of-Thought prompt technique3:39
Master the tree of thought prompting to structure prompts hierarchically for multiple reasoning paths, guiding prompt engineering toward tailored, contextually accurate ai outputs.
Applying the Tree-of-Thought Approach with Gemini
Reading material: Prompt engineering2:23

All notes on PD0:03
What is probability?7:27
Explore probability as a mathematical concept that measures how likely events are. Express probability as fraction, decimal, or percentage and apply it to simple, compound, mutually exclusive, and independent events.
Expected value v/s Actual value8:14
Explore the expected value and the gap to actual outcomes in probability, using a fair die to illustrate long run averages, weighted outcomes, trials, experiments, and experimental and theoretical probabilities.
Frequency in probability5:46
Explore probability frequency distributions that organize outcomes and their probabilities in a table. Use dice roll experiments and test score ranges to illustrate observed frequencies and calculated probabilities.
Complements in probability4:48
Explore complements in probability, where the probability of an event not happening equals one minus the event’s probability, and learn to apply this to at least one or none scenarios.
Knowledge Assessment 7
Intro to combinatorics7:46
Learn how combinatorics counts, arranges, and selects objects using permutations, combinations, and counting rules, and see how it contrasts with probability to solve problems in scheduling, cryptography, and game theory.
Permutations6:10
Learn how permutations count ordered arrangements where order matters, with examples like assigning three managers from ten, license plates, and DNA sequence patterns, using nPr = n!/(n-r)!.
Factorials operations5:32
Explore factorials, defined as the product of all integers from 1 to n, with examples like 5! = 120 and 8! = 40,320, highlighting combinatorics applications.
Combinations4:09
Explore combinations, where order does not matter, contrast with permutations, and apply the combination formula C(n, r) using factorials to form groups, teams, or committees.
Knowledge Assessment 8
Mutually exclusive sets5:33
Identify mutually exclusive sets where a and b share no elements, a ∩ b = ∅, and apply P(A or B) = P(A) + P(B) for exclusive events.
Set dependencies6:35
Explore set dependencies by contrasting independent and dependent events, applying probability rules and conditional probability with coin, dice, and card examples.
Conditional probability5:11
Explore conditional probability, the probability of an event given another, using the formula P(A|B)=P(A∩B)/P(B), and see real-world applications in medicine, marketing, engineering, and AI systems.
Knowledge Assessment 10
The additive rule7:05
Explore the additive rule and how to use cross-tabulations to compute marginal, joint, and conditional probabilities, including cases with mutually exclusive and overlapping events.
The multiplication law5:11
Explore the multiplication rule in probability, computing joint probability for sequential events, distinguishing independent and dependent draws, with card and ball-draw examples, and noting its link to conditional probability.
The bayes' law14:29
Apply Bayesian rule to update beliefs with new evidence, using prior probability and test accuracy; understand Bayes' theorem and law of total probability through medical diagnosis and production examples.
Knowledge Assessment 11
Population and Sample7:47
Define the population and the sample, explain why sampling matters, and outline random, representative, adequately sized methods like simple random, stratified, systematic, and cluster sampling.
Types of Statistical data4:42
Explore the types of statistical data, including qualitative and quantitative forms—nominal, ordinal, discrete, and continuous—and their use in descriptive, inferential, predictive, and trend analyses with Python.
Level of Measurement3:30
Explore levels of measurement, or scales, defining how data can be categorized, ordered, or measured, including nominal, ordinal, interval, and ratio, guiding appropriate analyses.
Knowledge Assessment 15
Intro to Distributions7:26
Explore the basics of distributions, how data spread and frequency reveal patterns, outliers, and variability, and why histograms and probability rest on distribution assumptions for analytics, business, and decision making.
Discrete distributions4:51
Explore discrete distributions, where outcomes are countable, with real-world examples like hourly call volume and library borrowings, and model with binomial and Poisson distributions using Python.
Continuous distributions10:19
Explore continuous distributions that model probabilities for variables with infinite precision within a range, and learn how mean, standard deviation, and z-scores reveal the normal distribution's bell-shaped curve.
Uniform distribution5:26
Explore the uniform distribution, where every value in a range is equally likely, shown by a flat histogram and a simple probability calculation, e.g., 30–50 within 20–70.
Bernoulli distribution5:10
Explore the Bernoulli distribution, a binary outcome model with a single parameter p and its pmf for success or failure, foundational for binomial, geometric, and logistic distributions.
Binomial distribution6:15
Explore the binomial distribution, modeling two-outcome trials with a fixed number of independent Bernoulli trials. Compute the probability of k successes in n trials using the PMF with probability p.
Poisson distribution5:14
Explore the Poisson distribution as the probability of exactly k events in a fixed interval with independent events, constant rate λ, and its mean, variance, and right skew.
Normal distribution6:29
Explore the normal distribution, a bell-shaped, symmetric model around the mean, defined by mu and sigma, and apply z-scores and the 68-95-99.7 rule to compute probabilities.
Students' T distribution13:15
The student’s t distribution handles small samples with unknown sigma, featuring heavier tails for inference. It underpins one-sample and two-sample t tests and confidence intervals, governed by degrees of freedom.
Chi-squared distribution8:52
Explore the chi-square distribution, a non-negative, continuous distribution built on squared differences, and apply it to goodness-of-fit, independence, and variance tests in normally distributed populations with observed and expected frequencies.
Exponential distribution9:59
Explore the exponential distribution, a memoryless continuous model of waiting times governed by lambda, with PDF and CDF for scheduling, queueing, and reliability.
Knowledge Assessment 12, 13, and 14

All notes on EDA0:03
Mean, Median & Mode7:00
Skewness and Kurtosis8:13
Explore skewness and kurtosis to understand distribution shape and outliers. Identify positive and negative skew, mesokurtic, leptokurtic, and platykurtic patterns, and understand excess kurtosis with quick calculations in Python.
Variance and Covariance6:55
Explore variance and covariance, learn their formulas, and interpret how single-variable spread relates to the mean and how two variables move together or apart, with correlation for strength.
Standard deviation9:30
Explore standard deviation as a key descriptive measure of data spread around the mean, including population and sample formulas, variance, and interpretation via the normal distribution and empirical rule.
Knowledge Assessment 16
What is inferential statistics6:38
Explore inferential statistics that generalize from sample to population, test hypotheses, estimate unknown values, and forecast future trends using sampling distributions, standard errors, and confidence intervals.
Central limit theorem7:03
explore the central limit theorem by showing how sample means from any population form a normal distribution as sample size grows, enabling confidence intervals and hypothesis testing.
Standard error7:35
Explore the standard error: a measure of uncertainty in sample means that informs confidence intervals and hypothesis tests, with se = sigma / sqrt(n) and larger samples reducing error.
Estimators and estimates5:52
Identify estimators as formulas applied to sample data to guess population parameters, and estimates as the resulting values, noting they should be unbiased, efficient, and consistent, with confidence intervals.
Knowledge Assessment 17
Confidence interval9:19
Explore how a confidence interval uses a point estimate, standard error, and z-score with a margin of error to bound the true population parameter from sample data.
Z-score v/s T-score9:35
Compare z scores and t scores to standardize data, using known population deviation for large samples and unknown deviation with small samples, enabling hypothesis testing and confidence interval construction.
Margin of error9:11
Explore how margin of error bounds a sample statistic and how z-score or t-score, standard error, sample size, and variability shape the interpretation.
Null v/s Alternative Hypothesis10:10
Explore how to test population claims using null and alternative hypotheses, one-tailed and two-tailed tests, and p value to decide whether to reject or retain H0.
Type | and Type || Error8:38
Learn about type one and type two errors in hypothesis testing, including alpha and beta, false positives and false negatives, and how sample size and power analysis reduce errors.
Knowledge Assessment 18
Step 1: Formulate the Hypotheses6:28
Translate real questions into precise, testable hypotheses by defining the parameter, writing the null and alternative hypotheses, and selecting one-tailed or two-tailed tests for means, proportions, or differences.
Step 2: Select Significance level10:24
Learn to identify the testing goal and data structure to select the right hypothesis test—t tests, chi-square, ANOVA, correlation, and regression—considering one- or two-tailed options and sample size.
Step 3: Perform assumption test16:04
Learn to perform assumption tests for parametric analysis, including normality, homogeneity of variance, independence, and linearity. When assumptions fail, choose nonparametric alternatives and visualize data with histograms and scatter plots.
Step 4: Perform appropriate test8:50
Learn how to select the significance level, or alpha, in hypothesis testing, balancing type I error, sample size, and stakes, with 0.1, 0.05, and 0.01 as examples.
Step 5: Decision and Conclusion5:52
Make a decision and conclude after a hypothesis test by defining the null and alternative hypotheses, choosing alpha, and using the p value to reject or fail to reject.
Knowledge Assessment 19
Kdeplot for distribution6:29
Learn how kernel density estimate (KDE) plots visualize distribution shapes, compare to histograms, and reveal normality, skewness, or multimodal patterns in data.
Shapiro Wilk test6:34
Apply the Shapiro Wilk test to assess normality of data, interpret the W statistic and p value, and decide if parametric methods are appropriate.
Data transformations methods8:01
Explore data transformation methods to stabilize variance and reduce skewness using square root, log, and Box-Cox, enabling positive data to meet parametric test assumptions.
Independent sample t-test5:03
Explore the independent sample t test to compare means of two independent groups, check assumptions of normality and equal variances, and interpret results via p-values.
Analysis of Variance5:27
Master one-way analysis of variance (ANOVA) to compare means of three or more independent groups. Apply hypotheses, p-values, and key assumptions, with a Python f_oneway example.
Chi square test6:10
Explore the chi square test for independence, a non-parametric method for testing relationships between two categorical variables, with hypotheses and p-values, and compute expected counts using Python's chi2_contingency.
Pearson correlation6:36
Explore Pearson's correlation, which quantifies the strength and direction of a linear relationship between two continuous variables using the correlation coefficient r and p-values.
Linear regression analysis14:59
Explore linear regression, predicting a dependent variable from one or more predictors, with intercept and slopes. Learn simple vs multiple models, key assumptions, residuals, and R squared for model fit.
How to generate new feature?8:33
Learn to generate new features from existing data to boost model performance, capturing non-linear patterns through transformations, ratios, aggregations, date decompositions, interactions, and categorical encodings like frequency and target encoding.
Extracting date elements5:50
Extract date elements to unlock time-based patterns and boost model accuracy by decomposing dates into year, month, day, weekday, quarter, and hour for revealing seasonality and cycles.
When to encode feature5:10
Explore feature encoding as a crucial step that converts categorical variables into numeric form for machine learning, using label, one hot, ordinal, and binary encodings.
When to bin feature7:54
Explore feature binding, or discretization, by grouping continuous values into bins, including equal width, equal frequency, or custom binding, to simplify data, capture non-linear patterns, and improve robustness and interpretability.
When to map feature6:02
Explore feature mapping to transform raw features into model-friendly representations, speeding learning, improving accuracy, and enabling complex relationships through value mapping, ordinal encoding, polynomial features, interaction mapping, and domain-based mappings.
When to generate dummies8:03
Convert categorical features to numbers with dummy variables through one-hot encoding, enabling models to learn from presence or absence of categories while dropping a baseline to avoid multicollinearity.
Feature selection5:21
Define the target variable and select relevant features using domain knowledge and statistical tests. Avoid leakage and multicollinearity for robust, accurate predictions.
Methods of Feature scaling9:21
Explore feature scaling with min max scalar and standard scaler to ensure fair feature contribution, accelerate gradient descent, and improve distance-based models like KNN and SVM.
What is Dimensionality reduction?8:53
PCA reduces dimensionality by transforming many variables into uncorrelated principal components that capture the most variance. It standardizes data, computes covariance, derives eigenvalues and eigenvectors, and selects top components.
Splitting Dataset5:17
Master the train test split to separate data into training and testing sets, apply split ratios and a random state, ensure generalization to unseen data, and guard against data leakage.

Datasets for this section0:14
Python for descriptive analysis1:35
Apply Python to compute descriptive statistics for numeric variables using the describe method, obtaining mean, standard deviation, variance, min, max, and percentiles.
Python for Shapiro Wilk test7:10
Learn to perform the Shapiro-Wilk test in Python by importing Shapiro from scipy.stats, assessing normality of numeric variables, and interpreting p-values at 0.05.
Hands-on: Shapiro Wilk test3:46
Apply the Shapiro-Wilk test to numeric columns to assess normality. Interpret p-values for age and average purchase amount, and note normality violations or acceptances.
Normality test
Python for data transformation18:39
Learn how to apply square root, log, and Box-Cox transformations in Python to normalize skewed data, assess normality with the Shapiro-Wilk test, and visualize results with KDE plots.
Hands-on: Data transformation14:01
Apply square root, logarithmic, and box-cox transformations, assess normality with Shapiro-Wilk tests, and visualize results with KDE plots to identify the best method.
SQRT transformation
LOG transformation
BOXCOX transformation
Python for Independent sample t-test7:51
Conduct an independent sample t test in Python to compare average purchase amount between churned and existing customers, using 0.05 significance and SciPy's ttest_ind after filtering churn_status yes or no.
Hands-on: Independent sample t-test4:10
Conduct an independent sample t-test on average purchase amount to compare churned and existing customers using scipy's ttest_ind, interpret the p-value, and note the significant difference with higher churned averages.
Independent sample t-test
Python for Analysis of Variance6:05
Apply one way analysis of variance to compare the average frequency of purchases across cities, test hypotheses with a 0.05 level, and interpret the p value using Python's SciPy f_oneway.
Hands-on: Analysis of Variance8:49
Apply Shapiro-Wilk test to assess normality of frequency of purchases, then perform a one-way ANOVA across Chicago, New York, Houston, and Los Angeles, using Levene’s test to conclude no differences.
Levene's test
Analysis of Variance
Python for Chi square test5:09
Apply the chi-square test for independence to assess the null and alternative hypotheses about region and purchase channel using a cross tab and a 0.05 significance level.
Hands-on: Chi square test2:08
Perform a chi square test for independence using SciPy's chi2_contingency on a cross tab of region and purchase channel, then interpret the p-value to conclude no significant association.
Chi square test
Python for Pearson correlation4:29
Perform a Pearson correlation between purchase frequency and average purchase amount to test for a significant relationship at 5%, after verifying normality and linearity with a scatter plot.
Hands-on: Pearson correlation5:58
Demonstrate a hands-on Pearson correlation to evaluate linearity between purchase frequency and average purchase amount, visualize with rake plot and scatter plots, and report the p value and positive relationship.
Pearson correlation
Python for Linear regression9:27
Apply Python linear regression to measure how frequency of purchases influences the average purchase amount, test hypotheses with a 5% significance level, and report the model summary.
Hands-on: Linear regression5:30
Apply a linear regression with statsmodels to predict the average purchase amount from frequency of purchases, add a constant, and interpret the ols results.
Linear Regression
Python for generating new features5:28
Develop new features through feature engineering using domain knowledge, such as total purchase amount and customer lifetime value, implemented with Python to enrich customer data.
Hands-on: Generating new features2:50
Create a new feature named customer_value from pre-processed data by multiplying frequency of purchases with average purchase amount, then compute customer lifetime value (clv) by lifespan in months, demonstrating execution.
Feature generation
Python for extracting date elements3:13
Extract date elements from a DateTime variable to create features for predictive data modeling. Derive year, month, and day values and add them as columns with Python code.
Hands-on: Extracting date elements4:20
Extract year, month, and day from the date of purchase using Python, ensure datetime64 dtype, handle dot accessor errors, and drop the original date of purchase column in preprocessed data.
Extracting day, month and year
Python for encoding feature4:21
Apply level encoding to convert ordinal categorical variables into numeric features using scikit-learn's level encoder, transforming churn status from yes/no to 1/0 for machine learning models.
Hands-on: Feature encoding1:54
Apply feature encoding to a binary categorical variable using sklearn's label encoder, performing fit_transform on churn status and verifying results in the pre-processed data.
Feature encoding
Python for binning feature4:44
Learn to convert a numeric variable into a categorical feature by binning with pandas pd.cut, creating a new bind column with defined bins and levels, and adjusting include lowest.
Hands-on: Feature binning5:20
Create an engagement_level variable from customer lifespan in months using pd.cut for feature binning, with bins 0–2, 2–3, and 3–5 labeled low, moderately engaged, and highly engaged.
Feature binning
Python for mapping feature1:47
Create a dictionary to map each categorical value to a numeric code, load it as mapping, and apply map to encode an ordinal categorical column.
Hands-on: Feature mapping2:09
Encode engagement levels with a Python dot map by building a string-to-number mapping for low, moderately engaged, and highly engaged, then apply it to pre-processed data and view results.
Feature mapping
Python for generating dummies3:34
Learn how to convert non-ordinal categorical variables into numeric features using pandas get_dummies, and merge the dummy columns into your dataset with pd.concat, for machine learning readiness.
hands-on: Generating dummies2:20
Derive all column names from the preprocessed data and generate dummy variables for gender, city, region, and purchase channel using pandas get_dummies and view first five rows.
Generating dummies
Python for Feature selection2:42
Develop a machine learning model by separating features and the target, loading features into x and the target into y, and removing redundant columns via drop and dummy variables.
Hand-on: Feature selection6:20
Practice feature selection by dropping identifiers and redundant columns from processed data, then prepare regression and classification datasets with x and y targets for CLV and churn status.
Feature selection
Python for scaling features4:57
Scale features with Python using standard scaler and min max scaler from sklearn.preprocessing, applying fit_transform to prepare features for a machine learning model on customer data.
Hands-on: Feature scaling4:54
Scale features for regression and classification models using standard scaler and min max scaler from sklearn.preprocessing, applying fit_transform to x_reg and x_class.
Standard scaling
MinMax Scaling
Python for Dimensionality reduction7:49
Apply PCA using sklearn to reduce feature dimensionality, compute explained variance ratio, and identify the optimal component count via a plotted variance line.
Hands-on: Dimensionality reduction5:08
Explore a hands-on principal component analysis (PCA) with sklearn, computing explained variance ratios and plotting AVR against the number of components to identify a single component that explains all variance.
Explained variance ratio
Select n_component
Principal component analysis
Python for train-test set5:46
Learn to split data into train and test sets with train_test_split, define X and y, and set test_size and random_state for reproducible evaluation on scaled features.
Hands-on: Train-test set3:42
Import train_test_split and create train and test sets for regression and classification models, then scale features with a min-max scaler and set test_size to 0.2 and random_state to 42.
Train test split
Datasets for this phase0:07
Python for Linear regression model6:02
Develop a machine learning linear regression model to predict customer lifetime value by combining statistics and computer vision, using scikit-learn to train, predict, and evaluate with mean squared error.
Hands-on: Linear regression model4:27
Import sklearn's linear regression and mean squared error, fit the model on train features and target, then predict test outcomes; evaluate with MSE and compare via a CDF plot.
Build Linear Regression
Prediction with Linear Regression
Model evaluation
Python for logistic regression7:07
Apply logistic regression to predict churn status from features, trained on x_train and y_train and tested on x_test, with accuracy, confusion matrix evaluation, and heatmap visualization.
Python for logistic regression6:38
apply logistic regression using sklearn to load, train, predict, and evaluate a classification model; compute accuracy and visualize a confusion matrix to interpret performance.
Build Logistic Regression
Evaluate the LGR model
Python for cross-validation6:26
Apply k-fold cross-validation in Python using stratified k-fold and cross_val_score with logistic regression, set max_eta to 1000, n_splits to 5, and random_state 42 for sports data.
Hands-on: k-fold cross validation5:19
Apply k-fold cross validation to a logistic regression model on sports data using Python. Include data preprocessing with standard scaler, feature-target split, and stratified k-fold scoring to gauge accuracy.
K-fold cross validation
Python for regularization3:57
Learn to apply L1 and L2 regularization to regression with Python using scikit-learn's lasso and ridge models. Configure alpha and max_iter, then fit, predict, and evaluate with MSE and RMSE.
Hands-on: Model regularization5:26
Apply L1 and L2 regularization to a linear regression model, compare lasso and ridge using mean squared error (MSE), and perform hyperparameter tuning.
L1 & L2 regularization
Python for oversampling methods2:16
Learn to apply smote-based oversampling on imbalanced data in python using the EMB learn oversampling module, training on xtrain with random_state 42 and printing results with a counter function.
Hands-on: oversampling methods3:55
Apply smote oversampling to balance imbalanced data, inspect class distribution with a counter, and compare distributions before and after resampling.
SMOTE - oversampling
Python for undersampling methods3:29
Apply Tomek links undersampling to imbalanced data, then optionally combine with SMOTE using the imblearn motomagx tool to balance classes and inspect with a class counter.
Hands-on: Undersampling methods4:26
Balance imbalanced data by applying Tomek links undersampling and a smote-tomek combination, using atomic links for balanced fraud detection.
Tomek Links - Undersampling
Python for KMeans clustering10:47
Learn to perform k-means clustering in Python, using scikit-learn to segment customers and apply the elbow method with wcss to choose the optimal number of clusters.
Hands-on: KMeans clustering12:11
Perform k-means clustering on customer data using recency, frequency, and monetary score; use the elbow method to choose k, then label clusters as regular and loyal for targeted segmentation.
Calculating WCSS
Plotting Elbow chart
Building KMeans cluster
Python for decision tree regression3:26
Learn to use the decision tree regressor to predict customer lifetime value, training with fit, predicting with predict, and evaluating with mean squared error in scikit-learn Python.
Hands-on: Decision tree regression3:50
Build a decision tree regressor with sklearn, train on x_rate_train and y_rate_train, predict on x_rate_test, and assess with mean squared error and plots comparing predicted to actual values.
Decision tree regression
Python for Decision tree classification2:54
Build a decision tree classifier to predict customer churn, train with xtrain and ytrain, and evaluate using accuracy and confusion matrix, then visualize results with a heat map.
Hands-on: Decision tree classification5:20
Explore building a decision tree classification model using sklearn, train on training data, predict test data, and evaluate with accuracy score and confusion matrix, comparing results to logistic regression.
Decision tree classification
Python for Random forest regression3:37
Apply the random forest regressor, an ensemble of decision trees, in Python with sklearn to train, predict, and evaluate customer lifetime value using mean squared error.
Hands-on: Random forest regression4:21
Train a random forest regression model with sklearn, fit on x_train and y_train, predict on x_test, and compute mean squared error, then compare with linear regression for customer lifetime value.
Random forest regression
Python for Random forest classification2:45
Learn to build a random forest classification model in Python to predict customer churn, import RandomForestClassifier from sklearn.ensemble, fit on features and target, and evaluate with accuracy and confusion matrix.
Hands-on: Random forest classification3:09
Import and train a random forest classifier from sklearn to predict customer churn, evaluate accuracy score and confusion matrix, and compare to logistic regression, achieving about 85% accuracy.
Random forest classification
Python for AdaBoost Models6:49
Learn to implement AdaBoost classification and regression in Python using scikit-learn, including model setup, training with 100 estimators and random state, and evaluation with classification report and RMSE for regression.
Hands-on: AdaBoost Models11:35
Learn to build AdaBoost classification and regression models in Python on Google Colab, including data loading, target encoding, feature scaling, and evaluation with the classification report and mean squared error.
AdaBoost Classification Model
Python for Traditional GBM Model4:26
Explore traditional gradient boosting with scikit-learn by building a classifier and regressor, tuning nestimators and learning rate, fitting on train data, and evaluating with a classification report and rmse.
Hands-on: Traditional GBM Model7:07
Develop a traditional gbm classification and regression pipeline on healthcare data, performing preprocessing, train-test split, feature scaling, and evaluation with the classification report and rmse.
Trad GBM Classification Model
Python for CatBoost Models4:51
Explore Python code for CatBoost models, including classification and regression, with import, fit, predict, evaluation via classification report and rmse, and mindful use of verbose and random state.
Hands-on: CatBoost Models9:01
Practically build and evaluate CatBoost classification and regression models in Python, including preprocessing, train-test split, feature scaling, model training, and performance metrics like MSE and RMSE.
CatBoost Classification Model
Python for LightGBM Models6:03
Develop gbm classifier and regressor using Lightgbm, evaluate with the classification report and rmse, and prepare for hyperparameter tuning with XGBoost in upcoming lessons.
Hands-on: LightGBM Models6:53
Develop and evaluate Lightgbm classification and regression models on healthcare data through end-to-end preprocessing, feature scaling, and train-test splits, using classification reports and RMSE for assessment.
LightGBM Classifier
Python for XGBoost Models3:02
Develop XGBoost classification and regression models in Python, train with parameters like n_estimators and learning rate, and evaluate using the classification report and rmse on healthcare data.
Hands-on: XGBoost Models7:04
Develop an XGBoost classification and regression model on healthcare data, covering preprocessing, train-test split, scaling, and evaluation with classification reports and RMSE.
XGBoost Models
Python for Hyperparameter tuning10:18
Leverage Python and Bayesian search CV to tune Xgbregressor hyperparameters, defining a search space for n_estimators, max_depth, learning_rate, subsample, and gamma, using RMSE as the optimization metric in 3-fold cross-validation.
Hands-on: Hyperparameter tuning10:54
Hyperparameter tuning of an xgbregressor for heart rate prediction on healthcare data using Bayesian search with scikit-optimize to minimize RMSE and MSE.
Bayes search CV
Deep Learning - The data10:19
Apply TensorFlow deep learning to the mnist dataset, using 28 by 28 grayscale images (784 pixels) to predict digits 0 to 9 with a softmax probability vector.
Deep Learning - Data Processing8:42
Load and preprocess the mnist data in TensorFlow, including creating train and test splits, normalizing pixel values by 255, and reshaping images to 784-length vectors for model input.
Deep Learning - Model training16:43
Develop a TensorFlow deep learning model with tf.keras sequential layers, 784 input shape, dense units (128, 64, 10) with sigmoid activations, softmax output for MNIST classification, trained with SGD.
Deep Learning - Model evaluation1:44
Evaluate a TensorFlow deep learning model on test data using test loss and test accuracy, confirming strong generalization with test accuracy around 93% and training accuracy around 92%, avoiding overfitting.
Solve Real problem with Deep Learning

PROJECT 1: Gen-AI Image Captioning38:54
Create a generative AI image captioning tool that fuses vision and language using cnn s and transformers. Build an end-to-end system with blip models and gradio to generate captions.
PROJECT 2: Gen-AI Chatbot1:18:44
Explore generative ai powered chatbot design using transformer models like llama and gamma, guided by NLP and self-attention, with applications in customer service, content creation, coding, and education.
PROJECT 3: Gen-AI Voice Assistant52:57
Explore how generative ai voice assistants combine multimodal inputs, large language models, and speech technologies to generate text and speech responses through a Gradio interface.
PROJECT 4: Gen-AI Text to Image36:03
Explore generative AI text-to-image creation using stable diffusion 1.5, latent diffusion models, and the diffusers library, with a Gradio interface to transform text prompts into high-quality images.
PROJECT 5: Gen-AI Video Summarizer1:13:35
Explore generative AI video summarization by transcribing YouTube video audio with Whisper and summarizing the transcript with BART large CNN, delivering concise summaries via a Gradio interface.
PROJECT 6: Gen-AI Language Translator35:50
Discover how generative AI language translation uses neural machine translation to deliver fluent, context-aware translations across 100 plus languages, powered by Facebook's MQM 101.2 model with a Gradio interface.
PROJECT 7: Gen-AI Data Analyst1:03:11
Explore generative AI data analysis with a gen-ai data analyst that automates data analysis, extracts insights, and enhances decision making using NLP, zero-shot classification, descriptive statistics, correlation, and regression.

Requirements

Access to computer and internet
Basic computer literacy
No coding experience required
Dedication, patience and perseverance

Description

Embark on a transformative journey into the world of Data Analytics, Data Science, and Machine Learning, where you’ll learn the essential skills, tools, and mindsets to become a successful data professional. This comprehensive program is designed to take you from beginner to advanced, equipping you with the knowledge and practical experience needed to excel in the field.

Whether you’re looking to kickstart a career in data analytics or enhance your existing skills, this course will empower you to succeed in the dynamic world of data. Join us on this exciting path and unlock your potential in just 60–100 days of disciplined learning.

Why This Course Matters

Most learners struggle with fragmented resources, inconsistent guidance, or theory-heavy content that doesn’t build real competence. This course solves that problem. It’s structured to provide step-by-step, cumulative, and daily progress — helping you turn knowledge into capability, and capability into career readiness.

We are in the AI revolution, and every industry is transforming with tools like ChatGPT, Stable Diffusion, and AI copilots for writing, coding, design, analytics, and more. This course ensures you don’t just learn theory — you’ll build real-world solutions that make you job-ready.

1. Foundations of Data Analytics, Data Science & Python

Learn how to think like a data scientist, not just how to write code.
Python fundamentals: variables, loops, conditionals, functions, data structures.
Clean, modular, reusable coding practices for data workflows.
Importing and handling real-world datasets with Pandas and NumPy.
Data types, memory optimization, and performance tuning.
A-Z data cleaning and manipulation techniques: sorting, filtering, pivot tables, and charts.

2. Excel, SQL, Python & Power BI Proficiency

Excel: Manipulate data, perform calculations, and create visualizations.
SQL: Query and manipulate relational databases, perform joins, aggregations, and optimize queries.
Python: Analyze and visualize data with Pandas, NumPy, and Matplotlib. Automate workflows and create advanced dashboards.
ChatGPT for Data Analysis: Handle missing data, outliers, dataset merging, pivoting, and even advanced ML predictions.
Power BI: Connect to multiple data sources, clean and transform data, and design interactive dashboards and reports.

3. Exploratory Data Analysis (EDA)

Understand the shape, distributions, and essence of raw data.
Advanced grouping, filtering, and reshaping with Pandas.
Visualize relationships using Matplotlib and Seaborn (histograms, pairplots, heatmaps).
Develop strong data intuition and hypothesis-forming skills.

4. Probability, Statistics & Mathematics for Data Science

Probability distributions: Normal, Binomial, Poisson, Exponential, Uniform.
Descriptive statistics: mean, median, mode, variance, standard deviation.
Inferential statistics: confidence intervals, hypothesis testing, chi-square, t-tests, ANOVA.
Linear Algebra: vectors, matrices, dot products, PCA foundations.
Calculus: derivatives, gradients, optimization, and gradient descent for ML.

5. Machine Learning & Feature Engineering

Complete ML workflow: preprocessing, training, validating, testing.
Algorithms: Logistic Regression, Decision Trees, Random Forests, KNN, Ensemble Methods.
Handling class imbalance (SMOTE, stratified sampling).
Model evaluation: accuracy, precision, recall, F1-score, ROC-AUC.
Bias-variance tradeoff, underfitting vs. overfitting.
Feature engineering: encoding categorical variables, scaling/normalizing, building pipelines.
Hyperparameter tuning (GridSearchCV, RandomizedSearchCV).

6. Deep Learning & Generative AI

Neural networks with TensorFlow: tensors, activation functions, backpropagation, optimizers.
Build and train models step by step, fine-tune, and evaluate with accuracy/loss metrics.
Prompt Engineering: Chain-of-Thought, Tree-of-Thought, structured prompts.
Generative AI Tools & Use Cases: text, image, code, audio, and video generation.
Real-world AI applications: chatbots, translators, voice assistants, text-to-image, video summarization.

7. Projects & Hands-On Practice

Over 30+ assignments, 120+ coding exercises, and 10 quizzes.
Capstone Projects:
- Bank Data Analysis
- Sports Data Analysis
- Fraud Detection & Classification
- Striker Ranking (End-to-End ML Deployment)
Generative AI Projects (7 full-scale builds):
- Image Captioning AI
- Chatbot with LLaMA2/Gemma
- AI Voice Assistant
- Text-to-Image Generator
- AI Video Summarizer
- Language Translator
- AI Data Analyst

Benefits of the Course

Career Readiness: Gain the technical and professional skills to qualify for data analyst and data scientist roles.
Versatility: Become proficient in Excel, SQL, Python, Power BI, TensorFlow, Hugging Face, and more.
Problem-Solving Skills: Sharpen your analytical and critical thinking abilities.
Portfolio Enhancement: Build a robust portfolio of real-world projects to showcase in interviews.
Industry-Relevant Learning: Stay up-to-date with modern data and AI methodologies.

How This Course Will Transform You

By following this structured roadmap, you’ll be able to:

Confidently work with real datasets and perform independent analysis.
Build, tune, and deploy machine learning and AI models.
Understand the mathematical foundations of modern data science.
Create a project portfolio strong enough for job interviews or freelance opportunities.
Qualify for entry-to-intermediate level roles in Data Science, ML Engineering, or Analytics.

One Honest Limitation

This course is not for learners who prefer highly animated, passive learning. The teaching style is text-based, code-first, and explanation-rich — emphasizing depth, clarity, and practical application. Diagrams and visuals are included, but the focus is on doing, thinking, and building.

Who this course is for:

Everyone!

What you'll learn

Explore related topics

Coding Exercises

Course content

Introduction to Data Analysis6 lectures • 1hr 16min

Python - Foundations for Programming39 lectures • 5hr 41min

Python - Data Cleaning & Exploratory Data Analysis (EDA)45 lectures • 6hr 1min

Introduction to Data Science25 lectures • 4hr 11min

Hypothesis Testing in Statistics Explained25 lectures • 2hr 25min

Machine Learning, Deep Learning & AI75 lectures • 9hr 20min

Probability & Distribution for Data Analytics29 lectures • 3hr 13min

Statistics & Data Preprocessing Methods Explained37 lectures • 4hr 43min

Python - Complete Data Science, Machine & Deep Learning75 lectures • 7hr 6min

Python - Developing AI Projects7 lectures • 6hr 19min

Requirements

Description

Who this course is for: