Name: The Complete Full Stack AI Engineering Bootcamp
Rating: 4.4 (40 reviews)

Udemy Business

Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Created byAritra Basak

Last updated 5/2026

English

What you'll learn

Build end to end AI Engineering projects using Python, PyTorch, scikit-learn, and SQL from data processing to model deployment.
Master Natural Language Processing (NLP) and Transformers by implementing real projects with Hugging Face, BERT, T5, and Large Language Models (LLMs).
Develop production ready AI APIs using FastAPI and Docker for scalable model deployment.
Understand and implement LangChain and LangGraph to build multi agent LLM applications with memory, tools, and workflows.
Learn Model Context Protocol (MCP) and create MCP servers and clients for advanced AI tool integration.
Perform data analysis, visualization, and feature engineering using Matplotlib and scikit-learn for machine learning pipelines.
Design AI systems with context engineering, prompt engineering, RAG, and memory management.
Gain practical skills required for AI Engineer, NLP Engineer, and LLM Engineer roles in the industry.
Run open source LLMs locally using Ollama
Integrate Ollama models with Python and LangChain to build local AI applications
Design scalable data pipelines using Azure Databricks, PySpark, and Delta Lake for real-world AI applications
Apply MLflow for experiment tracking, model management, and building reproducible AI and LLM workflows

Coding Exercises

This course includes our updated coding exercises so you can practice your skills as you learn.

Course content

28 sections • 338 lectures • 55h 44m total length

What is Machine Learning7:21
Explain why and when to use machine learning, contrasting a keyword-based spam pipeline with a data-driven model. Train on large data, deploy in production, and retrain regularly to adapt.
AI vs ML vs DL2:42
Explore the differences between ai, ml, dl, and generative ai, from expert systems to models that handle unstructured data and generate synthetic content.
Machine Learning Types8:47
Explore supervised, unsupervised, semi-supervised, self-supervised, and reinforcement learning, with labeled data, regression, classification, clustering, anomaly detection, and agent–environment dynamics.
Download Resources0:06
Quiz

Download Resources0:01
How Python Works15:18
Discover how Python operates as a procedural, object oriented, and functional language with dynamic typing and automatic memory management, and learn execution flow from lexical analysis to the virtual machine.
Python Installation1:43
Install python 3.13.7 and Visual Studio Code, then set up your development environment for machine learning on Windows or Mac.
Variables in Python11:32
Learn python fundamentals by building a number-guessing game that uses variables, user input, type conversion to int, print, and memory concepts like id addresses within a VS Code workflow.
Loops in Python8:35
Learn how to structure a Python game with a while loop and conditions to run for five rounds, validate input (1-9), convert strings to integers, and increment rounds.
Data Structure in Python16:53
Explore Python data structures—list, tuple, set, and dictionary—highlighting mutability, uniqueness, and common operations like append, insert, and pop, then apply them to manage users and a scoreboard in a game.
How Import Works in Python7:06
Learn to import the random module in Python and use randint to generate numbers for a guessing game. Update a per-user scoreboard dictionary on matches and print the final scoreboard.
Methods in Python6:10
Learn to refactor a simple Python game by creating methods with def, such as generateRandomNumber and VerifyNumber, passing randomNumber and user input to improve readability, maintainability, and proper scope.
Python Exception Handling6:59
Explore Python exception handling using try and except and finally to manage runtime errors such as division by zero, and ensure the code keeps running.
Object Oriented Programming in Python29:39
Explains object oriented programming in Python by modeling real world entities as classes and objects, and demonstrates encapsulation, inheritance, polymorphism, and abstraction with practical code examples.
List Comprehension in Python8:35
Master list comprehension and dictionary comprehension in Python for machine learning, including filtering evens, flattening a 2d array to a 1d array, and selecting grades above 80.
File Handling in Python17:27
Learn how to load and preprocess data in Python using file handling, including reading, writing, and appending text files, plus copying binary files like bird.png in practical data science workflows.
Generator and Decorator in Python13:46
Explore how Python decorators wrap functions to inject behavior, such as generating random numbers or validating JWT tokens, and how generators use yield to process data efficiently in web apps.
Python Multiprocessing9:09
Explore python multiprocessing to accelerate cpu intensive tasks by using cores and threads, overcome the GIL with pool.map, and fetch headlines from multiple sites in parallel using request and BeautifulSoup.
Python Asyncio21:55
Learn how Python asyncio enables non-blocking io for external calls using async and await, and how to run coroutines on the event loop for concurrent tasks with aiohttp.
Virtual Environment Setup in Python14:18
Learn to create and activate a Python virtual environment to isolate Flask dependencies and manage them with pip and a requirements.txt for consistent GitHub sharing.
Python Lambda Function3:16
Learn Python lambda functions, including anonymous functions defined with the lambda keyword, with examples of addition, conditional grading, and nested lambdas that multiply numbers.
Python Map, Filter & Reduce4:25
Explore Python's filter, map, and reduce process lists: use filter with a lambda to extract evens, map to transform values, and reduce to a single result with functools.
Access Course Code: NumPy, Pandas, Matplotlib & Seaborn0:03
NumPy Fundamentals in Python33:56
Delve into NumPy fundamentals in Python as part of the complete full stack AI engineering bootcamp.
Pandas Fundamentals in Python21:47
Explore pandas fundamentals in python by learning to create and manipulate data frames, perform filtering and updates, and compute aggregations with practical examples.
Matplotlib Fundamentals in Python23:55
Develop hands-on Matplotlib fundamentals in Python by building line plots, multiple lines with legend, scatter plots, bar charts, and histograms using NumPy for synthetic data.
Seaborn Fundamentals in Python12:56
Master seaborn fundamentals in Python, built on matplotlib, by creating line, scatter, histogram with kde, box, and bar plots. Use synthetic data and themes to visualize relationships and distributions.

What is Vector8:57
Explore vectors across 2d, 3d, and higher dimensions, understanding magnitude and direction versus scalars. Learn how vectors model data features and measure similarity in machine learning, including housing price datasets.
Different Types of Vectors4:16
Explore how row vectors, column vectors, and matrices differ in shape and orientation, with 1x5 and 5x1 representations and the transition to a matrix when adding a second column.
Vector Addition Operation2:40
Explore vector addition and subtraction in 2d using p vector and q vector with coordinates a1,b1 and a2,b2, and visualize the resultant from origin to the sum's tip.
What is Vector Span3:40
Explore vector span as the reach of a set of vectors in a space, describing how scalar stretching and vector addition generate all linear combinations a v1 plus b v2.
Vector Multiplication11:12
Explore dot and cross products in vector multiplication; the dot product yields a scalar via magnitude and cos theta, while the cross product yields a perpendicular vector.
Vector Intuition for Machine Learning Project8:26
Represent a housing price dataset with size and bedrooms as vectors in a matrix, then apply dot products and matrix multiplication with a weight vector to predict price.
What is Derivative8:34
Explore the derivative concept, its formula, and how it measures a function's rate of change and slope, with implications for gradient descent in machine learning.
What is Partial Derivative3:21
Partial derivatives extend one-variable derivatives to multiple variables, measuring how a function changes with respect to one variable while the others stay constant. For f(x,y)=x^2+xy+y^2, ∂f/∂x=2x+y and ∂f/∂y=x+2y.
Intuition of Gradients3:47
Define the gradient as the vector of all partial derivatives, capturing slope for multi-variable functions, such as f(x, y) = 3x^2 + 2xy + 2y^2.
Chain Rule of Derivative2:26
Master the chain rule by differentiating a composite function, identifying inner and outer functions, and applying du/dx to obtain the derivative of (3x+2)^5 as 15(3x+2)^4.
Quiz

Introduction to Statistics3:02
Explore how statistics underpins machine learning by enabling descriptive analysis, inferential reasoning, feature engineering, probability theory, and model evaluation to understand data and make predictions.
Level of Measurement in Statistics5:36
Explore descriptive statistics and how to categorize data by level of measurement. Distinguish qualitative data into nominal and ordinal, and quantitative data into interval and ratio with practical examples.
Mean, Median & Mode7:00
Learn how mean, median, and mode summarize data, using classroom scores to illustrate when to use each measure, including how outliers affect the mean and why median or mode matter.
Probability Distribution Function in Statistics7:53
Discover how the probability distribution function models the relationship between outcomes and probabilities for a random variable, using coin and dice examples, and learn to visualize with a pdf.
What is Probability Mass Function7:38
Master the pmf for discrete variables, the pdf for continuous variables, and the cdf's up-to-a-value concept, illustrated with dice, coin tosses, and cgpa examples.
Small Correction in PMF0:58
Clarify that the probability mass function for a discrete random variable plots outcomes on the x-axis (1–6) with their probabilities on the y-axis, not a probability distribution.
Probability Density Function in Statistics5:39
Learn how the probability density function for a continuous variable, like cgpa from 0 to 10, uses area under the curve to compute probabilities for a given range.
What is Normal Distribution11:54
Explore the normal distribution, or bell curve, where mean, median, and mode center the data; learn about standard deviation and the empirical rule for data spread and outliers.
Population and Sample in Statistics3:07
Learn how population and sample work in a state blood pressure study, with 10 million people and 5,000 representative samples. Apply statistical tests to infer the total population.
Common Statistical Terms You Should Know9:34
Z-Scores in Statistics9:36
Understand how z-scores measure how far a data point is from the mean in a Gaussian distribution, using mean and standard deviation, and standardization for feature engineering.
Non-Gaussian Distribution Explained21:04
Analyze non-gaussian distributions, including right and left skew, bimodal and multimodal patterns, uniform and heavy-tailed shapes, with examples like income, retirement age, and stock returns.
Binomial & Poisson Distribution13:52
Explore binomial distribution with two-outcome, fixed trials, computing exact probabilities like four heads in ten flips, and apply Poisson distribution to model counts in a fixed interval.
What is Central Limit Theorem2:19
The central limit theorem states that the sampling distribution of the mean becomes normal regardless of population distribution with samples of 30 or more, enabling z-score and confidence intervals.
Confidence Interval in Statistics5:35
Explore how a point estimate and margin of error form a 95% confidence interval that indicates where the parameter lies, illustrated with a mango juice sampling analogy.
Hypothesis Testing15:37
Learn hypothesis testing by formulating null and alternative hypotheses, using sample data to infer population means, and applying z-scores, confidence levels, and p-values to reject or fail to reject.
T-Test and ANOVA13:07
Explore anova and t-test as hypothesis tests for feature selection in machine learning, focusing on categorical features with two or more categories, and include chi-square and z-test when applicable.
Quiz

Introduction to Data Visualization2:24
Data visualization uncovers patterns in a housing price data set of 10,000 records and 20 features, guiding actions through Python tools like seaborn, matplotlib, pandas, and numpy.
Project Setup3:58
Create and activate a dv python virtual environment, install numpy, pandas, matplotlib, C bond package, and ipy kernel via requirements.txt, and start a project notebook for data visualization.
Bar Chart Explained6:57
Learn how to construct a bar chart from a frequency distribution by dividing data into five age intervals, plotting frequencies with Matplotlib, and interpreting which ranges dominate.
Downloading Datasets from Kaggle5:15
Download a housing price dataset from Kaggle, save it as csv in a data folder, then load it in Python as a pandas data frame and inspect with head.
Mean, Median and Mode3:18
Learn to compute mean, median, and mode from a data frame, explore standard deviation, and use describe to summarize numerical columns.
Histogram and KDE Plot7:49
Explore visualizing probability density functions and data distributions with kernel density estimate (kde) plots and histograms using sns and matplotlib, comparing area, bedrooms, parking, and price distributions for housing data.
Scatter Plot4:40
Visualize the relationship between area and price with a scatter plot and explore how bedrooms affect price. Use hue for main road and identify simple outliers.
Visualizing Relationship with Pairplots3:45
Extract numerical columns and compute their correlations, then visualize them with a Seaborn pairplot to show how price relates to area, bedrooms, bathrooms, stories, and parking.
Bar Plot Explained3:12
use a bar plot to show how discrete numerical values like bedrooms affect price, noting that four and five bedrooms may be higher, while area and other factors matter.
Visualizing Categorical Data with Count Plots2:32
Visualize the distribution of categorical features with a count plot using seaborn, exploring main road and guest room variables and optional hue for deeper insight.
Finding Patterns in a Dataset Using Data Visualization8:16
Load dataset into a Python notebook with pandas, create a data frame, compute mean, median, standard deviation, and correlations, then plot scatterplots, barplots, and kde to uncover patterns.
Data Visualization
Data Visualization for Categorical Data

Introduction to Feature Engineering9:12
Learn how feature engineering cleans noisy data, handles missing values and categorical data, detects outliers, and scales features, then transforms, constructs, selects, and extracts features to boost model performance.
What is Standardization10:41
Apply standardization as part of feature scaling before modeling. Compute per-column mean and standard deviation, then transform values with (x - mu)/sigma to achieve zero mean and unit variance.
What is Normalization4:41
Normalization shows the formula x minus xmin over xmax minus xmin, scales data to zero to one (or minus one to one), and addresses outliers with median and iqr.
Ordinal Encoding Explained4:41
Encode ordinal data by ranking categories and converting education levels into numerical values, distinguishing ordinal from nominal data, and applying ordinal encoding with scikit-learn on a student dataset.
One Hot Encoding (OHE) Explained10:05
Master one hot encoding for nominal data, contrast with ordinal encoding for education, and learn to avoid dummy variable trap by dimensionality reduction before applying scikit-learn.
What is Variable Transformation7:36
Apply variable transformation to convert non-normal column distributions into normal distributions. Use Gaussian, logarithmic, reciprocal, square root, exponential, Box-Cox, and Johnson transformations, illustrated with salary data.
How to Handle Mixed Variable in a Dataset3:43
Split the mixed variable odc details into categorical and numerical columns, then apply one-hot encoding to the categorical part to prepare an employee details dataset for modeling.
How to Handle Missing Data in a Dataset18:33
Learn how to handle missing data by removing rows under mcar and low missing percent, and imputing with univariate mean or median, or with arbitrary value and end-of-distribution methods.
KNN Imputation11:25
KNN imputation uses other columns to fill missing values, a multivariate approach that preserves correlations. It measures row distances and fills with the average of the k nearest neighbors.
KNN Imputation
MICE Imputation10:17
Learn how mice imputation handles missing at random data by replacing missing values with mean, then iteratively predicting with a model using related features, and averaging the results.
How to Detect Outliers in a Dataset10:21
Learn how to detect outliers using z-score for normally distributed data and iqr for skewed data, and apply trimming or capping using mu plus or minus three standard deviations.
Detect Outliers
Feature Construction Explained2:13
Master feature construction to create new features from existing data to improve model performance, using domain knowledge and intuition, illustrated by deriving family type from sibling and parent features.

Introduction to EDA13:22
Define the scope of a data science project, gather and clean data, perform exploratory data analysis, feature engineering, train and deploy the model, then evaluate its production performance.
EDA vs Feature Engineering6:44
Clarify that cleaning data is separate from feature engineering, fix missing values and typos, adjust data types, remove duplicates, then perform exploratory data analysis and prepare the data for modeling.
Project Setup4:02
Set up a data science project with a virtual environment named eda-demo, load streaming.csv from Kaggle, and install pandas, numpy, matplotlib, seaborn, scikit-learn, and ipykernel for Jupyter in VS Code.
Exploring the Dataset7:56
Analyze customer churn in a video streaming platform with exploratory data analysis, documenting project scope and data requirements, highlighting age, gender, region, and subscription status.
Changing Data Types in a Dataset5:11
Drop nonessential columns, inspect data types with df.dtypes, and cast features—e.g., convert age from float64 to float32—to save memory while noting precision trade-offs.
How to Perform Imputation on a Dataset11:32
Identify missing values with df.info and df.isna, then compare MCAR thresholds and imputation techniques—mean, median, KNN, and MICE—highlighting age’s multi-modal distribution and imputation effects.
Imputation on Discreate Features4:28
Learn to impute missing values in a discrete satisfaction score column using mode imputation with scikit-learn's SimpleImputer, visualize distributions with count plots, and verify zero missing values.
Data Cleaning in EDA8:51
Clean data in EDA by removing typos and duplicates, checking for inconsistencies, and replacing negative monthly spend with the minimum positive value (not imputation).
Detecting Outliers in a Dataset12:52
Identify and remove outliers in continuous numerical columns, using statistics and KDE plots to assess distributions of discount offered and monthly spend, then apply z-score based filtering.
Data Analysis17:14
Conduct data analysis on churn, discount, satisfaction scores, support tickets, and region to reveal that discount is not the main driver, while low satisfaction drives cancellations.
Feature Engineering in a Dataset5:44
Perform feature engineering by converting categorical columns to numerical values, scaling numerical features, dropping the customer ID, and applying a 0.2 train-test split with a pipeline.
Data Leakage Explained6:29
Learn how data leakage occurs in a data science pipeline and how a proper train-test split before outlier detection prevents leakage, ensuring robust model training.

Introduction to RDBMS and SQL7:49
Explore how SQL fuels relational databases, using PostgreSQL as the focus, to store and query credentials, products, and purchases through tables and their relationships.
PostgreSQL Installation on Windows3:39
Download and install PostgreSQL on Windows with pgAdmin, create a SQL demo database in pgAdmin, and prepare to write SQL queries in the next video.
Create Our First Table in PostgreSQL4:12
Create a pgsql database and a cities table with name, country, population, and area, using varchar(50) and integer data types, in the complete full stack ai engineering bootcamp.
Insert Records in PostgreSQL4:30
Learn how to insert records into the cities table using insert into values, specify column order, and verify results with select from cities and where clauses.
SQL Update Query1:41
Apply the update keyword to modify a record in the cities table by setting the population to 5 where the name equals Kochi.
SQL Delete Query1:10
Master deleting a specific record with sql by using delete from cities where name = Kochi, then verify the deletion with select * from cities.
Drop vs Truncate Command in SQL3:35
Compare drop and truncate in SQL to understand table deletion commands. Drop removes the table and its structure, while truncate deletes all rows but keeps the table structure.
SQL Alter Keyword1:51
Learn how to use the SQL alter keyword to modify the products table by adding a new column, adjusting data types and constraints such as unique, and changing column length.
Master SQL Commands: DDL, DML, DQL & DCL3:01
Master the four SQL command families—DDL (data definition language: create, drop, alter), DML (data manipulation: insert, delete, update), DQL (data query: select), and DCL (data control: grant, revoke).
How SQL Executes a Query8:16
Discover how SQL performs math operations to compute population density by dividing population by area, aliasing as population density, and filtering results with a where clause, plus execution flow insight.
Database Design Strategy10:21
Design a database by defining tables, columns, and relationships using a practical Instagram example, detailing users, posts, comments, and likes with one-to-many and many-to-one links.
Understanding Primary Key and Foreign Key in SQL7:10
Create users and post tables with serial ids as primary keys, and include username, email, and url. Establish a one to many relation by adding a foreign key referencing users.id.
What is Data Consistency in SQL9:31
Explore how data consistency is maintained in SQL by using foreign keys between users and posts, and by applying on delete actions like cascade, set null, and set default.
SQL Joins Explained: Inner, Left, Right and Full15:01
Explore sql joins with practical examples using the users and post tables, covering inner, left, right, and full joins, and discover how user_id relates to id.
What is GROUP BY in SQL7:22
Learn how to use group by to aggregate posts by user id, count records per user, and apply having to filter results in SQL queries.
Understanding Subqueries in SQL5:16
Learn how subqueries work by computing the average price per manufacturer with an inner query and a group by on mobile_records, then the outer query with max finds highest average.
Understanding Database Normalization14:48
Explore database normalization, covering first, second, and third normal forms, functional dependency, candidate keys, non-prime attributes, and the criteria behind BCNF design.
Mastering Views in SQL5:07
Discover how views simplify complex sql queries by creating a virtual table from multiple tables, such as users and posts, with examples of create, update, limit, and drop operations.
Materialized Views in SQL6:12
Explore materialized views that cache expensive join results across multiple tables to speed SQL queries and learn how to refresh the cache when data changes.
Concurrency in SQL: Locks, Isolation and Transactions4:58
Explore how concurrency is managed in sql transactions by using begin and commit to lock rows, illustrating isolation and preventing overbooking when two users book tickets.
Understanding Array and JSON Data Types in SQL5:44
Create a student table with an integer array for marks and a jsonb address, insert records, and query array elements and json fields to extract the street.
Mastering Stored Procedures in SQL17:36
Learn to create and call PostgreSQL stored procedures in plpgsql, update the sales and products tables, and enforce stock checks with conditional raises and foreign keys.
Mastering Window Functions in SQL4:10
Explore sql window functions to calculate values across sets of rows, using over and partition by to compute maximum salaries by department within the employees table.
Understanding ACID Properties in PostgreSQL7:30
Explore atomicity, consistency, isolation, and durability in PostgreSQL and other RDBMS with transactional examples. Learn how write-ahead logs ensure durability and recovery after crashes.
Quiz

First Step Before Any Machine Learning Project13:57
Explore how business context guides machine learning for real estate pricing, from data collection in databases, cloud, files, apis, and web scraping to supervised training, feature engineering, and api deployment.
Simple Linear Regression Intuition8:11
Explore simple linear regression with one input and one output, using housing data to fit the best line by updating theta0 and theta1 for y = theta0 + theta1 x.
What is Cost Function6:01
Explain the cost function in simple linear regression, showing how to fit the best line using the hypothesis h_theta(x) and minimize the mean square error J(theta0, theta1) across data points.
What is Gradient Descent9:52
Explore gradient descent for linear regression, minimizing the mean square error by updating theta0 and theta1 toward the global minimum using the convergence algorithm and learning rate alpha.
Learning Rate Intuition5:41
Learn how the learning rate alpha controls gradient descent toward the global minima. The slope guides adjusting theta j, and a 0.5 stepping analogy shows why a learning rate matters.
Mathematical Intuition Behind Weights Update13:16
Explore the mean squared error cost function and gradient descent updates to derive theta0 and theta1, guiding weights toward the global minimum for linear regression.
Multiple Linear Regression3:51
Explore multiple linear regression, extending simple regression to multiple features like bathrooms, parking, and rooms. See how h_theta(x) uses theta0 and feature coefficients, and how J measures error.
Performance metrics of Models9:53
Assess how well a regression model explains data using R-squared, which compares model error to the mean baseline, and use adjusted R-squared to account for non-correlated features.
Understand Different Cost Functions7:38
Understand MSE, MAE, and RMSE, their formulas, advantages like a single global minimum and differentiability, and MAE's robustness to outliers.
Overfitting and Underfitting in Machine Learning6:47
Explore the bias-variance tradeoff by comparing overfitting, underfitting, and generalized models through training, validation, and test data, while optimizing weights (theta) and cost via hyperparameter tuning.
What is Mode Validation8:21
Evaluate regression models by validating R2, MAE, and RMSE to gauge variance explained, average error, and outlier impact, guiding acceptance or refinement.
Simple Linear Regression in Scikit-learn8:33
Learn to build a simple linear regression model with scikit-learn using toy dataset of weight and height, visualize relationship with a scatter plot, and prepare x and y for training.
Train the Model using Scikit-learn9:16
Split the dataset into train and test sets using train_test_split, standardize features with StandardScaler, and train a linear regression model while guarding against data leakage.
Validate Model Performance6:49
Validate a trained linear regression model by computing R2, RMSE, and MAE, then compare training and testing R2 to assess generalization and potential underfitting.
Multiple Linear Regression Using Scikit-learn16:05
Learn multiple linear regression with scikit-learn on a housing price dataset, using features like square feet, bedrooms, and neighborhood. Master preprocessing, SGD regression, and evaluating with R2, RMSE, and MAE.
What is Polynomial Regression7:12
Discover polynomial regression for non-linear data, using degree-based equations to model one or multiple features, and understand overfitting as degree increases.
Polynomial Regression Using Scikit-learn4:24
Apply polynomial features with scikit-learn to fit nonlinear data, split into train and test sets, scale features, and compare r² scores across degrees 1, 2, and 8.
What is Ridge Regression11:49
Practice ridge regression, also known as l2 regularization, to reduce overfitting by adding a lambda term to the cost function and shrinking coefficients in linear and multiple linear regression.
What is Lasso Regression7:03
Explore how lasso regression (L1 regularization) performs feature selection by shrinking coefficients to zero, contrasting with ridge (L2) regularization, and learn when elastic net helps with highly correlated data.
Understand Cross Validation6:27
Understand cross validation for hyperparameter tuning, including k-fold cross validation and stratified k-fold, with training and validation splits and shuffled data in scikit-learn.
Ridge and Lasso Regression in Scikit-learn8:37
Practice ridge and lasso regression in scikit-learn, optimize alpha with grid search cross-validation, and evaluate using RMSE on a housing price dataset.
Mathematical Intuition Behind Logistic Regression9:43
Explore logistic regression for binary classification and how it differs from linear regression; see how outliers distort the decision boundary and 0.5 threshold for pass/fail from study hours.
Understand Logistic Regression Equation5:44
Learn how logistic regression turns a linear decision boundary into a sigmoid-based model for binary classification, using h_theta(x) = 1/(1+e^{-theta0 - theta1 x}) and a 0.5 threshold.
Cost Function for Logistic Regression7:54
Explore the logistic regression cost function, compare it to linear regression, and introduce a convex cross-entropy form with gradient descent for binary classification.
Performance Metrics for Logistic Regression12:54
Analyze confusion matrices for logistic regression, including true/false positives and negatives in binary classification. Use accuracy on balanced data, and precision, recall, and F1 or F-beta scores for imbalanced cases.
Understand Multi Level Classification4:48
Explore multi-level binary classification using four logistic regression models, each predicting one class such as spam, not spam, promotional, and ads, with the highest probability determining the final label.
Code Logistic Regression Using Scikit-learn6:52
Code logistic regression with scikit-learn by generating a synthetic dataset with make_classification, splitting into train and test, training the model, and evaluating with accuracy, confusion matrix, and classification report.
Linear Regression vs Logistic Regression9:31
Explain the contrast between linear regression for continuous outputs with linear feature–label relations, and logistic regression for classification, illustrated by housing prices and spam detection.
Support Vector Machine (SVM)7:41
Learn how a support vector machine classifies data by separating two classes with a maximal margin, identifying support vectors, and using soft and hard margin concepts (SVC and SVR).
SVM Math Intuition8:54
Explore the math intuition of SVMs in a 2d plane, deriving the boundary w^T x + c = 0 and margins w^T x + c = ±1 with distance 2/||w||.
SVM Cost Functions7:19
Explore hard and soft margin SVMs, including the decision boundary and margin planes where support vectors lie on the marginal planes, and learn the soft-margin hinge loss to handle misclassifications.
Understand SVM Kernels6:10
Understand svm kernels transform non-linear data into higher dimensions to become linearly separable, enabling the decision boundary and margins in classification; svm shines on non-linear, small, high-dimensional data.
Build SVM Model Using Scikit-learn9:52
Explore building a support vector machine with scikit-learn using synthetic two-feature data, train/test splits, and linear and rbf kernels, with evaluation via classification report and confusion matrix.
K-Nearest Neighbors (KNN)9:53
Discover the k-nearest neighbors algorithm, an instance-based lazy learner for classification and regression. Select k, measure distances, and average outputs for regression or majority-vote for classification.
Naive Bayes Algorithm8:06
Explore the Naive Bayes algorithm for classification, grounded in Bayes' theorem and conditional probability, illustrated with a ball-drawing example and the meaning of P(A|B) in machine learning.
Naive Bayes Algorithm Math Intuition8:26
Apply Naive Bayes to classify using features x1 through x4. Compute P(y|x1..x4) as P(y) times the product of P(xi|y) divided by the product of P(xi) to decide yes or no.
Apply Naive Bayes on Dataset8:14
Apply naive bayes to a cricket dataset, compute P(yes) and P(x|yes) for outlook, temperature, and humidity, and predict play cricket using scikit-learn.
Types of Naive Bayes Algorithm2:26
Master the types of Naive Bayes algorithms, including Bernoulli, Multinomial, and Gaussian, and learn when to apply each to binary features, text data, or Gaussian-distributed features.
Naive Bayes using Scikit-learn8:51
Learn to implement Gaussian Naive Bayes with scikit-learn on the iris dataset: load data, split into train and test, train the model, and evaluate with accuracy and a classification report.
Decision Tree Algorithm10:44
Explore how a decision tree uses categorical features like outlook and humidity to split data, identify pure and impure states, and determine cricket play using yes no probabilities.
Calculate Entropy in Decision Tree Algorithm11:32
Calculate entropy and guinea impurity to determine pure and impure states in a decision tree. The lecture explains the formulas, an example, and entropy versus guinea impurity comparisons.
Calculate Gain in Decision Tree Algorithm7:37
Explore information gain to select the best feature for the root node and splits in a decision tree, using entropy and gain impurity to reach leaf nodes.
Pre-Pruning and Post-Pruning Intuition3:39
Explore pre-pruning and post-pruning techniques to reduce overfitting in decision trees, adjusting hyperparameters and pruning branches to reach optimal leaf nodes.
Decision Tree Regressor14:36
the lecture explains applying a decision tree regressor to numerical data, detailing sorting features, evaluating splits by information gain and variance reduction, and selecting the optimal root node.
When to Use Decision Tree1:24
Apply a decision tree to reveal why a model makes a decision, handling classification and regression, non-linearity, and missing values, while not suited for long data or noisy data.
Decision Tree Algorithm Using Scikit-learn11:01
Build and visualize a decision tree on the iris data set using scikit-learn. Train the model, evaluate with confusion matrix and classification report, and tune hyperparameters via grid search.
Random Forest Algorithm8:34
Learn how the random forest algorithm uses bagging and boosting ensemble techniques to combine models such as decision trees, logistic regression, naive Bayes, and SVM for classification and regression tasks.
Random Forest Algorithm Intuition3:54
Explore how random forest uses bagging to train multiple decision trees in parallel on random data and feature samples, with hyperparameter tuning to select the number of models, reducing overfitting.
Introduction to AdaBoost Algorithm10:56
Learn how AdaBoost uses sequential one-depth decision trees as weak learners, weighted by alphas, to reduce bias and prevent underfitting and overfitting through gain-based stump selection.
Calculate Weights in AdaBoost Algorithm11:54
Compute the initial sample weights, evaluate the stump's performance, derive alpha, and update weights by boosting correctly classified points and upweighting misclassified ones in AdaBoost.
AdaBoost Algorithm Mathematical Intuition9:38
In AdaBoost, normalize the updated weights, assign bins, and use weighted data points to select samples for the next stump, iterating to reduce bias.
AdaBoost vs Random Forest Algorithm3:10
Compare AdaBoost and random forest as boosting versus bagging. AdaBoost uses sequential training to reduce bias on small data; random forest uses parallel training to reduce variance and resist noise.
Introduction to XGBoost Algorithm1:48
Introduce XGBoost as a performance library built on gradient boosting, clarify that XGBoost is a library, not an algorithm, and highlight its speed, L1 and L2 regularization, and distributed training.
Different Features of XGBoost Algorithm11:02
XGBoost enables parallel processing, optimized structures, and distributed and out-of-core computing, while handling missing values and using regularization with quantile sketch for fast splits.
How Gradient Boosting Works8:53
Explore how gradient boosting and the XGBoost regressor tackle regression by starting from the mean of the package, then reduce residuals with learning rate and decision trees.
XGBoost on Regression Problem27:18
Learn how XGBoost differs from gradient boosting in regression by iteratively reducing residuals with decision trees, using a mean base model and regularization to improve predictions.
XGBoost on Classification Problem24:52
Explore how the XGBoost classifier predicts whether a student is placed from CGPA by starting with a base model using log odds, then building staged trees to reduce residuals.
Build Model with XGBoost and Scikit-learn8:11
Install XGBoost, load the breast cancer dataset, split data, train an XGBoost classifier with 100 estimators and 4 max_depth, and evaluate with accuracy and a classification report.
When to Use Which Algorithms14:54
Explore when to use nine machine learning algorithms, from linear and logistic regression to XGBoost. Apply enterprise use cases like retail forecasting, credit risk scoring, and fraud detection in banking.
Quiz

Requirements

No prior knowledge of AI or machine learning is required. Everything is taught from the ground up
A laptop or desktop with internet connection to run Python, VS Code, and required tools
An OpenAI API key will be required for some LLM and AI application examples (usage may incur small costs)
Azure Databricks will be used in the data engineering section, and creating/using it may involve cloud usage charges

Description

Welcome to the step by step roadmap to becoming an AI engineer

This course is designed to take you from the very basics to advanced real world AI engineering with a clear learning path. You will not just learn concepts. You will build systems, understand why they work, and learn how different pieces of AI, data, machine learning, deep learning, and LLM engineering connect together in real projects.

Most courses teach tools in isolation. This course teaches you how to connect everything into one complete AI engineering workflow that is used in industry.

In this course you will start from the foundations and gradually move towards advanced AI systems used today.

You will learn

• Python programming from beginner to advanced concepts for AI work

• How data is represented using vectors and matrices

• Data visualization, statistics, and feature engineering techniques

• SQL and PostgreSQL which every AI engineer must know

• Supervised machine learning algorithms like regression, SVM, decision tree, and XGBoost using scikit learn

• Unsupervised learning algorithms like K means and DBSCAN with proper visualization

• How to build APIs using FastAPI and run AI applications inside Docker containers

• Data engineering fundamentals with Kafka and Spark architecture and why data engineering is critical in AI projects

• Build scalable AI engineering pipelines using Azure Databricks, PySpark, Delta Lake, and end to end data workflows

• Deep learning foundations and the mathematics behind neural networks

• CNN and how machines process and understand images

• NLP fundamentals like tokenization, embeddings, and word embeddings

• Sequence models including RNN, LSTM, and GRU

• PyTorch from scratch and building ANN, CNN, and RNN models using PyTorch

• Transformer architecture explained step by step in simple terms

• LLM engineering, RAG, advanced RAG architecture, LangChain, LangGraph, AI agents, and LLM workflows

• Master MLflow for AI engineering, including experiment tracking and prompt experimentation and evaluation

• MCP and how to build a local MCP server and client

• Practical Guide to Running Open Source LLMs Locally using Ollama

This is a complete theory plus hands on course. You will write code, build projects, deploy models, and understand the logic behind every step instead of copying and pasting code.

If you want to transition your career into AI, understand how modern AI systems are built, and become job ready as an AI engineer, this course is for you.

Join this course and start your journey to become a full stack AI engineer.

Who this course is for:

Beginners who want a clear and structured roadmap to enter the field of AI and machine learning from scratch
Software developers and full stack developers who want to transition into AI engineering and learn how real AI systems are built and deployed
Data analysts, data engineers, and aspiring data scientists who want to strengthen their practical AI and deep learning skills
Students and fresh graduates who want to become job ready AI engineers

What you'll learn

Explore related topics

Coding Exercises

Course content

Introduction to AI Engineering4 lectures • 19min

Complete Course Notes – Your AI Engineering Playbook1 lecture • 1min

Python Fundamentals For AI Engineering23 lectures • 4hr 49min

Math For Machine Learning10 lectures • 57min

Statistics for Data Science17 lectures • 2hr 24min

Data Visualization11 lectures • 52min

Feature Engineering for Machine Learning12 lectures • 1hr 43min

Exploratory Data Analysis (EDA)12 lectures • 1hr 44min

SQL for AI Engineers24 lectures • 2hr 41min

Supervised Machine Learning with Scikit-learn59 lectures • 8hr 49min

Requirements

Description

Who this course is for: