Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

PyTorch: Deep Learning and Artificial Intelligence

Name: PyTorch: Deep Learning and Artificial Intelligence
Rating: 4.7 (2650 reviews)

Neural Networks for Computer Vision, Time Series Forecasting, NLP, GANs, Reinforcement Learning, and More!

Created byLazy Programmer Team, Lazy Programmer Inc.

Last updated 3/2026

English

English [Auto],

What you'll learn

Artificial Neural Networks (ANNs) / Deep Neural Networks (DNNs)
Predict Stock Returns
Time Series Forecasting
Computer Vision
How to build a Deep Reinforcement Learning Stock Trading Bot
GANs (Generative Adversarial Networks)
Recommender Systems
Image Recognition
Convolutional Neural Networks (CNNs)
Recurrent Neural Networks (RNNs)
Natural Language Processing (NLP) with Deep Learning
Demonstrate Moore's Law using Code
Transfer Learning to create state-of-the-art image classifiers
Understand important foundations for OpenAI ChatGPT, GPT-4, DALL-E, Midjourney, and Stable Diffusion

Course content

23 sections • 151 lectures • 24h 24m total length

Welcome4:03
Explore neural networks from neurons to convolutional and recurrent architectures, learn GANs, reinforcement learning, NLP, and applications like stock prediction, speech recognition, and self-driving tech using PyTorch.
Overview and Outline13:14
this PyTorch course outlines the curriculum, covering fundamental architectures and time series forecasting, then explores applications like natural language processing, recommender systems, and reinforcement learning using Google Colab.

Where to get the code, notebooks, and data4:38
Discover where to access the code, notebooks, and data for PyTorch by using the resources tab, code link, and GitHub repo, with notes on notebooks versus plain text Python files.
How to Succeed in This Course3:04
Ask questions via the Q&A to accelerate learning, as answers typically arrive within 24 hours; meet prerequisites and stay engaged by coding or taking handwritten notes.
Temporary 403 Errors2:57
Handle 403 errors by downloading the file in a browser and uploading it via Colab's file explorer, then continue using the file in your notebook.

Intro to Google Colab, how to use a GPU or TPU for free12:33
Discover Google Colab as a cloud-based Python notebook that lets you use free GPU or TPU, with preinstalled libraries and easy Google Drive sharing.
Uploading your own data to Google Colab13:12
Learn how to upload your own data to Google Colab via wget, file uploads, and drive mounting; load with pandas, and create histograms and scatter plots.
Where can I learn about Numpy, Scipy, Matplotlib, Pandas, and Scikit-Learn?11:24
Learn numpy, pandas, and matplotlib basics for deep learning, with scipy and scikit-learn concepts, PyTorch workflows, and data shapes X and Y, training, and evaluation.

What is Machine Learning?14:26
Explore how machine learning is essentially a geometry problem, using supervised learning to fit lines or curves for regression and to separate categories for classification with data points and features.
Regression Basics14:39
Explore regression basics by fitting a line y hat equals M X plus B to X and y, using mean squared error and gradient descent to find M and B.
Regression Code Preparation11:45
Explore how to prepare code for PyTorch linear regression, build the model, define loss and optimizer, perform gradient steps, and handle tensors and data types.
Regression Notebook13:14
Learn linear regression with a PyTorch model to find the line of best fit from synthetic data. Train with MSE loss and an optimizer, and plot losses to gauge convergence.
Moore's Law6:57
Apply Moore's law to real world data by modeling the exponential growth of transistor counts with a log transformation and linear regression, and recognize data normalization caveats.
Moore's Law Notebook13:51
Explore a PyTorch linear regression notebook that validates Moore's Law by modeling transistor growth, using CSV data, normalization, a log transform, and a line of best fit.
Linear Classification Basics15:06
Explore linear classification concepts, including how a line separates two classes using a sigmoid activation and binary cross entropy loss, and implement training steps in PyTorch.
Classification Code Preparation6:56
Load breast cancer data from the cycle learn API, normalize with a standard scaler, split into train and test sets, train a linear classifier with a sigmoid, and evaluate accuracy.
Classification Notebook12:00
Train a binary classifier for breast cancer with PyTorch, using a train/test split, standardized features, a linear layer with sigmoid, BCE loss, and Adam, monitoring train and test accuracy.
Saving and Loading a Model5:21
Save a PyTorch model by saving its state dict to a file, then load it later for evaluation, verify performance, and download or share the file from Colab.
A Short Neuroscience Primer9:51
Explore how linear regression and logistic regression underlie neural computation, linking weights, bias, and input features to classification and the all-or-nothing neuron model via sigmoid activation.
How does a model "learn"?10:50
Explore how models learn from data using linear regression, mean squared error, gradient descent, and learning rate with PyTorch automatic differentiation.
Model With Logits4:18
Explore a numerically stable alternative for logistic regression by training a linear model with BCE with logits loss, using the logit output directly instead of a sigmoid.
Train Sets vs. Validation Sets vs. Test Sets10:12
Split data into train, validation, and test sets to manage generalization and the bias-variance tradeoff. Use cross-validation and validation scores to select models and assess performance on unseen data.
Suggestion Box3:10
Submit feedback via the suggestion box to improve the PyTorch: deep learning and artificial intelligence course, sharing your background, course you’re taking, difficulty, and missing explanations or future topics.

Artificial Neural Networks Section Introduction6:00
Introduce feed-forward neural networks, a basic neuron-inspired model. Explore architecture, activation functions, and multi-class classification for processing images and other unstructured data.
Forward Propagation9:40
Explore forward propagation in neural networks, from single neurons to deep, wide layers, using the sigmoid and matrix notation to learn hierarchical features for classification and regression.
The Geometrical Picture9:43
Explore how neural networks transform geometry into nonlinear decision boundaries by learning nonlinear features automatically, via sigmoid-activated hidden layers, without manual feature engineering, using tensor flow playground.
Activation Functions17:18
Explore sigmoid and tanh activation functions, their vanishing gradient and zero-centered issues, and evaluate relu and variants like leaky relu, elu, and softplus.
Multiclass Classification9:39
Master multiclass classification in PyTorch using softmax to map activations to category probabilities. Compare cross-entropy loss, binary vs multiclass tasks, and k output nodes for effective deep learning models.
How to Represent Images12:21
Learn how images are stored as height, width, and rgb color channels in 8-bit or 0–1 float values, and how to flatten to vectors or form 4d tensors for networks.
Color Mixing Clarification0:54
Learn how pigment color mixing differs from digital RGB, noting that mixing red, blue, and yellow often yields brownish colors, while RGB represents images on computers.
Code Preparation (ANN)14:57
Load the amnesty handwritten digits dataset from the Torture Vision library, flatten 28x28 grayscale images to 784 features, and train a 784-128-10 ann for multiclass classification with batch gradient descent.
ANN for Image Classification18:28
Train a neural network for image classification in PyTorch using torchvision transforms and data loaders. Run on GPU when available and evaluate with cross-entropy loss, accuracy, and a confusion matrix.
ANN for Regression10:55
this lecture shows a two-input neural network for regression, with one hidden layer of 128 units, using mse loss on synthetic data to visualize the cosine surface in three-dimensional plots.
How to Choose Hyperparameters6:16
Choose hyperparameters through experimentation, not rules, using random searches, performance plots, and paper baselines for starting points, with learning rate and other settings explored on a log scale.

What is Convolution? (part 1)16:38
Learn how convolution transforms images in neural networks, using input, filter (kernel), and output, with examples like blur and edge detection, and understand padding and valid/same/full modes.
What is Convolution? (part 2)5:56
Explore how convolution acts as a sliding pattern finder, using dot products and cosine similarity to detect patterns in images through cross-correlation.
What is Convolution? (part 3)6:41
Explore the equivalence of convolution and matrix multiplication, reveal weight sharing to reduce parameters, and explain translational invariance for image feature detection.
Convolution on Color Images15:58
Extend convolution to color images using three-dimensional filters that scan height, width, and color channels to produce stacked feature maps with biases and activations.
CNN Architecture20:53
Explore the cnn architecture, from convolution and pooling to dense layers, and learn how hierarchical feature maps, stride, and global pooling shape image processing in pytorch.
CNN Code Preparation (part 1)17:42
Explore CNN code preparation in PyTorch, building models with convolution modules, flattening, and sequential versus custom architectures. Learn about dropout, train/eval modes, and evolving model design beyond simple stacks.
CNN Code Preparation (part 2)8:00
Compute the dimensionality of the initial feature vector into dense layers using convolution arithmetic with padding and strides, noting PyTorch uses explicit padding rather than same mode.
CNN Code Preparation (part 3)5:40
Learn to prepare convolutional neural network code by loading fashion amnesty and CFR 10 datasets, building and training the model, evaluating with accuracy and confusion matrices, and handling class encoding.
CNN for Fashion MNIST11:32
Explore a PyTorch cnn approach for fashion mnist in a Colab notebook, building convolutional and dense layers with activation and dropout, and evaluating train and test accuracy.
CNN for CIFAR-108:05
Explore CNN-based image classification on CIFAR-10 in Colab notebook, using color 32 by 32 images, data loaders, and functional forward pass; evaluate with accuracy, a confusion matrix, and misclassified samples.
Data Augmentation9:45
apply on-the-fly data augmentation with torch vision transforms to improve model generalization, using randomized rotations, flips, and color changes within the training data loader.
Batch Normalization5:14
Learn how batch normalization normalizes activations per batch by subtracting the batch mean and dividing by the batch standard deviation, then re-scaling with learned gamma and beta.
Improving CIFAR-10 Results10:46
Improve cifar-10 results by applying data augmentation and batch normalization, tune hyperparameters in a colab notebook, and experiment with convolutional architectures inspired by VG networks using padding and pooling.

Sequence Data22:14
Master sequence data in deep learning, shaped as n by t by d, with examples like stock prices and weather. Use padding and batch processing to manage unequal sequence lengths.
Forecasting10:58
Learn how to forecast time series correctly, predicting multiple steps ahead using past values and a linear regression baseline, then extend to autoregressive approaches and iterative multi-step forecasts in PyTorch.
Autoregressive Linear Model for Time Series Prediction12:15
Explore autoregressive linear models for time series prediction by building a synthetic sine wave dataset with 10-step inputs and comparing incorrect and correct forecasting in PyTorch.
Proof that the Linear Model Works4:12
Demonstrates how a linear recurrence model (R2) can perfectly predict a sine wave using only two past values, without a bias, through autoregressive derivation with trig identities.
Recurrent Neural Networks21:31
Explore recurrent neural networks and how they model sequences with a hidden state across time steps, using shared weights to process multi-dimensional time series and predict outputs.
RNN Code Preparation13:49
Develop a simple PyTorch RNN for time-series forecasting, handling end-to-end steps from data loading to evaluation and prediction, including multi-layer RNNs, hidden state initialization, and shape management.
RNN for Time Series Prediction9:29
Learn how a recurrent neural network approaches time series prediction, compare with an auto regressive model, and analyze single-step and multi-step forecasts with noisy data.
Paying Attention to Shapes9:33
Learn to track shapes in art using a PyTorch rnn, examining sequence length t, input dimensionality d, hidden units m, and outputs k through the forward pass.
GRU and LSTM (pt 1)17:35
Learn why vanishing gradients necessitated recurrent units like LSTM and GRU, and how update and reset gates preserve long-term memory through a convex combination.
GRU and LSTM (pt 2)11:45
Explore GRU and LSTM (pt 2) by detailing forget, input, and output gates, cell state dynamics, and how these gates enable long-term memory, with experiments showing LSTM often outperforms GRU.
A More Challenging Sequence10:28
Compare autoregressive linear models with neural networks on a non-linear time series, showing how a neural approach better captures changing frequency and supports one-step and multi-step forecasts.
RNN for Image Classification (Theory)4:41
Apply an rnn to image classification by treating images as a multi-dimensional time series, scanning rows top to bottom, with a final dense layer of 10 outputs and softmax activation.
RNN for Image Classification (Code)2:48
Train a rnn for image classification on the M9 dataset using a prepared code lab notebook, loading 28 by 28 images, and achieve 99 percent accuracy after ten epochs.
Stock Return Predictions using LSTMs (pt 1)12:24
Learn to predict stock returns with LSTMs using real Starbucks data, explore time-series windows and standardization, and critically assess multi-step forecasts and common pitfalls.
Stock Return Predictions using LSTMs (pt 2)6:16
Learn why predicting stock returns matters, define return as (final price minus initial price) over initial, and build an autoregressive LSTM on normalized returns with one-step and multi-step forecasts.
Stock Return Predictions using LSTMs (pt 3)11:46
Develop an lstm-based stock predictor using open, high, low, close, and volume data, cast as a binary up or down classification with binary cross-entropy, exploring train/test splits and overfitting insights.
Other Ways to Forecast5:14
Explore alternative multi-step forecasts, compare one-step and iterative methods, and use baselines like the naive forecast and random walk to evaluate multi-output models forecasting twelve steps ahead.

Embeddings13:12
This lecture explains why one-hot encoding is impractical for words and shows how embedding layers map word indices to dense vectors for sequence processing by recurrent neurons.
Neural Networks with Embeddings3:45
Build a PyTorch text model with an embedding layer mapping word indices to vectors, then process via the embedding, alice module, and final linear layer to yield K outputs.
Text Preprocessing Concepts13:33
Learn how to convert text to numbers for embedding layers by tokenization, word-to-index mapping, and padding strategies (pre or post) for fixed-length sequences.
Beginner Blues - PyTorch NLP Version10:36
Learn practical text preprocessing in PyTorch NLP by tokenizing, building word-to-index mappings with padding, converting documents to integer inputs, and coding from scratch to withstand library API changes.
(Legacy) Text Preprocessing Code Preparation11:53
Learn how to preprocess text for classification in torch text by converting documents into integer sequences, building vocab, and storing data in a CSB with input and label fields.
(Legacy) Text Preprocessing Code Example7:53
Learn text preprocessing in a notebook by building a CSV data frame, tokenizing with spaCy, forming a vocab, and applying padding for train and test data.
Text Classification with LSTMs (V2)17:42
Explore text classification with LSTMs in a CoLab notebook, focusing on spam detection, tokenization, word to index mapping, padding, embedding, and training to achieve high accuracy.
CNNs for Text12:07
This lecture explains how one-dimensional convolutional neural networks process text, using embeddings and global max pooling to extract features, followed by dense layers for binary classification.
Text Classification with CNNs (V2)7:16
Examine a one-dimensional CNN for text classification, detailing embedding, preprocessing, and convolutional pooling, and show high accuracy on a spambots csfi dataset.
(Legacy) VIP: Making Predictions with a Trained NLP Model7:37
Extend a trained NLP model to predict by preprocessing text, tokenizing, mapping to integers, and using the predict function, checking imbalanced classes with a confusion matrix and two prediction methods.
VIP: Making Predictions with a Trained NLP Model (V2)4:21
Learn to make predictions with a trained NLP model by tokenizing text, converting to a list of integers, creating a torch tensor, and predicting whether the input is spam.

Recommender Systems with Deep Learning Theory10:26
Learn how deep learning builds recommender systems from user–item–rating triples with incomplete data, using embeddings to represent users and items, then a neural network for rating prediction, matrix factorization parallels.
Recommender Systems with Deep Learning Code Preparation9:38
Map users and movies to embeddings, combine them into a joint feature matrix, and train a two-input neural network for rating prediction using a data loader pipeline.
Recommender Systems with Deep Learning Code (pt 1)8:52
Build a deep learning recommender system using the movie lens dataset, with user and movie embeddings and a two-layer neural network. Prepare data, standardize ratings, and monitor RMSE via profiling.
Recommender Systems with Deep Learning Code (pt 2)12:31
Explore a modified recommender systems code in PyTorch, compare with TensorFlow, diagnose embedding-layer weight initialization, and implement a custom NumPy training loop for faster, improved mean squared error.
VIP: Making Predictions with a Trained Recommender Model4:51
Use a trained recommender model to predict top unseen movies for a user. Prepare inputs as tensors, compute predictions, sort in descending order, and select the top 10 with scores.

Transfer Learning Theory8:12
Use transfer learning by freezing the pretrained cnn body as a feature extractor and training a new head, enabling fast training with limited data.
Some Pre-trained Models (VGG, ResNet, Inception, MobileNet)4:05
Explore transfer learning with CNN architectures including veggie (VGG) variants, resonant networks with residual branches, Inception with parallel convolutions and multiple filter sizes, and mobile-friendly networks for lightweight devices.
Large Datasets7:11
Learn to manage large image datasets by loading data from disk with batch gradient descent, using TorchVision's image folder with train and validation class folders.
2 Approaches to Transfer Learning4:51
Explores two transfer learning approaches. One trains the full cnn with data augmentation inside the loop; the other pre compute feature vectors z and trains a logistic regression.
Transfer Learning Code (pt 1)9:36
Practice transfer learning with data augmentation in PyTorch, using a pretrained torchvision model, freezing weights, and replacing the classifier with a new linear head to classify food vs non-food images.
Transfer Learning Code (pt 2)7:40
Explore transfer learning without data augmentation using a prepared notebook, precomputing features, and training a fast logistic regression classifier with about 200x speedup and comparable accuracy.

Requirements

Know how to code in Python and Numpy
For the theoretical parts (optional), understand derivatives and probability

Description

Ever wondered how AI technologies like OpenAI ChatGPT, GPT-4, DALL-E, Midjourney, and Stable Diffusion really work? In this course, you will learn the foundations of these groundbreaking applications.

Welcome to PyTorch: Deep Learning and Artificial Intelligence!

Although Google's Deep Learning library Tensorflow has gained massive popularity over the past few years, PyTorch has been the library of choice for professionals and researchers around the globe for deep learning and artificial intelligence.

Is it possible that Tensorflow is popular only because Google is popular and used effective marketing?

Why did Tensorflow change so significantly between version 1 and version 2? Was there something deeply flawed with it, and are there still potential problems?

It is less well-known that PyTorch is backed by another Internet giant, Facebook (specifically, the Facebook AI Research Lab - FAIR). So if you want a popular deep learning library backed by billion dollar companies and lots of community support, you can't go wrong with PyTorch. And maybe it's a bonus that the library won't completely ruin all your old code when it advances to the next version. ;)

On the flip side, it is very well-known that all the top AI shops (ex. OpenAI, Apple, and JPMorgan Chase) use PyTorch. OpenAI just recently switched to PyTorch in 2020, a strong sign that PyTorch is picking up steam.

If you are a professional, you will quickly recognize that building and testing new ideas is extremely easy with PyTorch, while it can be pretty hard in other libraries that try to do everything for you. Oh, and it's faster.

Deep Learning has been responsible for some amazing achievements recently, such as:

Generating beautiful, photo-realistic images of people and things that never existed (GANs)
Beating world champions in the strategy game Go, and complex video games like CS:GO and Dota 2 (Deep Reinforcement Learning)
Self-driving cars (Computer Vision)
Speech recognition (e.g. Siri) and machine translation (Natural Language Processing)
Even creating videos of people doing and saying things they never did (DeepFakes - a potentially nefarious application of deep learning)

This course is for beginner-level students all the way up to expert-level students. How can this be?

If you've just taken my free Numpy prerequisite, then you know everything you need to jump right in. We will start with some very basic machine learning models and advance to state of the art concepts.

Along the way, you will learn about all of the major deep learning architectures, such as Deep Neural Networks, Convolutional Neural Networks (image processing), and Recurrent Neural Networks (sequence data).

Current projects include:

Natural Language Processing (NLP)
Recommender Systems
Transfer Learning for Computer Vision
Generative Adversarial Networks (GANs)
Deep Reinforcement Learning Stock Trading Bot

Even if you've taken all of my previous courses already, you will still learn about how to convert your previous code so that it uses PyTorch, and there are all-new and never-before-seen projects in this course such as time series forecasting and how to do stock predictions.

This course is designed for students who want to learn fast, but there are also "in-depth" sections in case you want to dig a little deeper into the theory (like what is a loss function, and what are the different types of gradient descent approaches).

I'm taking the approach that even if you are not 100% comfortable with the mathematical concepts, you can still do this! In this course, we focus more on the PyTorch library, rather than deriving any mathematical equations. I have tons of courses for that already, so there is no need to repeat that here.

Instructor's Note: This course focuses on breadth rather than depth, with less theory in favor of building more cool stuff. If you are looking for a more theory-dense course, this is not it. Generally, for each of these topics (recommender systems, natural language processing, reinforcement learning, computer vision, GANs, etc.) I already have courses singularly focused on those topics.

Thanks for reading, and I’ll see you in class!

WHAT ORDER SHOULD I TAKE YOUR COURSES IN?:

Check out the lecture "Machine Learning and AI Prerequisite Roadmap" (available in the FAQ of any of my courses, including the free Numpy course)

UNIQUE FEATURES

Every line of code explained in detail - email me any time if you disagree
No wasted time "typing" on the keyboard like other courses - let's be honest, nobody can really write code worth learning about in just 20 minutes from scratch
Not afraid of university-level math - get important details about algorithms that other courses leave out

Who this course is for:

Beginners to advanced students who want to learn about deep learning and AI in PyTorch

PyTorch: Deep Learning and Artificial Intelligence

What you'll learn

Explore related topics

Course content

Introduction2 lectures • 17min

Getting Set Up3 lectures • 11min

Google Colab3 lectures • 37min

Machine Learning and Neurons15 lectures • 2hr 33min

Feedforward Artificial Neural Networks11 lectures • 1hr 56min

Convolutional Neural Networks13 lectures • 2hr 23min

Recurrent Neural Networks, Time Series, and Sequence Data17 lectures • 3hr 7min

Natural Language Processing (NLP)11 lectures • 1hr 50min

Recommender Systems5 lectures • 46min

Transfer Learning for Computer Vision6 lectures • 42min

Requirements

Description

Who this course is for: