Udemy Business

Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Supervised Learning for AI with Python and Tensorflow 2

Name: Supervised Learning for AI with Python and Tensorflow 2
Rating: 4.7 (39 reviews)

Uncover the Concepts and Techniques to Build and Train your own Artificial Intelligence Models

Created byJeremy Richard Lai Hong

Last updated 11/2021

English

What you'll learn

The basics of supervised learning: What are parameters, What is a bias node, Why do we use a learning rate
Techniques for dealing with data: How to Split Datasets, One-hot Encoding, Handling Missing Values
Vectors, matrices and creating faster code using Vectorization
Mathematical concepts such as Optimization, Derivatives and Gradient Descent
Gain a deep understanding behind the fundamentals of Feedforward, Convolutional and Recurrent Neural Networks
Build Feedforward, Convolutional and Recurrent Neural Networks using only the fundamentals
How to use Tensorflow 2.0 and Keras to build models, create TFRecords and save and load models
Practical project: Style Transfer - Use AI to draw an image in the style of your favorite artist
Practical project: Object Detection - Use AI to Detect the bounding box locations of objects inside of images
Practical project: Transfer Learning - Learn to leverage large pretrained AI models to work on new datasets
Practical project: One-Shot Learning - Learn to build AI models to perform tasks such as Face recognition
Practical project: Text Generation - Build an AI model to generate text similar to Romeo and Juliet
Practical project: Sentiment Classification - Build an AI model to determine whether text is overall negative or positive
Practical project: Attention Model - Build an attention model to build an interpretable AI model

Course content

6 sections • 113 lectures • 21h 12m total length

Introduction5:56
Explore supervised learning by mapping inputs to outputs with neural networks in Python using TensorFlow 2, including image and text tasks, object detection, sentiment analysis, in Google Colab.
Syllabus4:05
Master supervised learning foundations, build neural networks from scratch, and apply gradient descent, perceptron, convolutional neural networks and recurrent neural networks to image and text tasks.
Setup Coding Environment Resources0:14
Setup Coding Environment12:35
Set up coding environments for supervised learning using Google Colab, Anakonda on your local machine, or a GPU-enabled setup. Download the code from the resources page and follow setup tips.

Artificial Intelligence Machine Learning Supervised Learning8:47
Explore how artificial intelligence, machine learning, and supervised learning map inputs to outputs using labeled data, with examples like English to French translation, and contrast unsupervised and reinforcement learning.
Parameters and threshold function14:59
Explore supervised learning with parametric models that fix the number of parameters, and learn to initialize and iteratively update weights from training data, using a threshold function to produce outputs.
Simple parametric model lab10:29
Train a simple parametric supervised learning model by formatting training data, fitting weights, and iterating to monitor progress toward convergence.
Model Intuition and Lab6:30
Explore a binary linear classifier by visualizing y_hat = x1 w1 + x2 w2 with weights set to 1 and the boundary x1 + x2 = 0.5, then plot points.
Learning rate and code clean up13:40
Explore how learning rate affects training, causing updates to weights through iterations, and learn to generalize code by using a weights list and loop-based updates.
A gentle introduction to vectors6:04
Explore vectors as generalizations of scalars, cover element-wise operations and dot product, and learn vector notation, transposition, and element indexing to speed up code via vectorization.
Vectorization Lab13:20
Vectorize the training method by replacing loops with the dot product between X and the weights, then benchmark with random data to show five to six times faster performance.
What is a Bias Node6:44
discover how a bias node shifts the decision boundary to solve problems that previously couldn't be solved, by adding an extra input and updating weights and bias.
Bias Node and Dynamic Decision Boundary Lab10:01
Explore how adding a bias node shifts the decision boundary by adjusting weights and biases, through a practical lab that updates, plots, and analyzes dynamic boundaries.
The Perceptron Algorithm and Lab17:18
Explore the perceptron algorithm from theory to hands-on lab, learning how to initialize weights and bias, apply a threshold function, adjust with learning rate, and assess generalization with datasets.
Non-Binary Inputs and Feature Scaling14:17
Explore non-binary inputs and feature scaling for supervised learning. A simple perceptron trained on house data uses max magnitude scaling to reduce training iterations and improve generalization.
Working with Real Data11:45
Learn to move from toy problems to real data by splitting datasets into training, validation, and testing sets, preventing overfitting, and handling missing values.
Working with Real Data Lab Part 130:52
Download real Titanic data from Kaggle, load it with pandas data frames, perform one-hot encoding, impute missing values, and prepare training and validation features for model fitting.
Working with Real Data Lab Part 212:37
Train a perceptron on scaled Titanic data after imputing missing ages, evaluate training and validation accuracy, and generate Kaggle submissions predicting passenger survival.
Saving and Loading Weights6:47
Learn how to save and load model weights with a simple perceptron using pickle, enabling training reuse and live predictions on large datasets.
Training Improvements9:27
Learn training improvements for the perceptron model, including tracking best validation accuracy, saving weights, and data ordering to boost Titanic testing accuracy and Kaggle predictions.
Classification vs Regression9:21
Explore supervised learning by adapting a perceptron from classification to linear regression, adjusting activation and evaluation with mean absolute difference, and intuitively fitting a line to data.
019 - Limitations of Perceptrons3:02
Demonstrates why a perceptron, a linear classifier, fails on non-linearly separable data like xor, and previews neural networks while covering basics of learning, weights, threshold, and data preparation.

Introduction to Neural Networks4:40
Explore the fundamentals of neural networks and deep learning, including perceptrons, weights, biases, and activation functions. See how hidden layers and nonlinear mappings power models like convolutional networks and transformers.
Logistic Regression Overview8:43
Explore logistic regression as a linear binary classifier, using the sigmoid activation to interpret outputs as probabilities and learning parameters via gradient descent with binary cross-entropy loss.
A Gentle Introduction to Derivatives15:53
Explore derivatives as rates of change, learn to compute dy/dx and limits, and apply five rules—power, constant multiplier, sum and difference, zero, and chain rule—to minimize the error function.
Gradient Descent6:28
Learn gradient descent, an optimization algorithm that minimizes the error function by updating weights and biases in the direction of steepest descent using a learning rate.
Logistic Regression Equations8:53
Explore logistic regression equations, compute model outputs with z = w^T x + b, apply the sigmoid activation, and update weights using gradient descent with binary cross-entropy loss.
Logistic Regression Lab8:03
Transform the perceptron into a logistic regression model using the sigmoid activation, train on the Titanic data, and boost Kaggle predictions.
Introduction to Matrices6:28
Learn matrices as two-dimensional number arrays, explore their shape and transpose, and master scalar and matrix arithmetic, elementwise operations, and dot-product based matrix multiplication.
Further Vectorization for Logistic Regression Lab10:56
Demonstrates vectorizing logistic regression with matrix multiplication, improving speed threefold while preserving accuracy, and setting the stage for neural networks.
Notation for Neural Networks7:19
Compare logistic regression to neural networks by introducing hidden layers for nonlinear classification, then master forward and backpropagation and notation for weights, biases, and layers.
Forward Propagation7:29
Apply forward propagation in a neural network by computing z values, applying sigmoid activations, and using matrix-vector notation with weights and biases to derive layer-wise outputs.
Forward Propagation Lab15:17
Implement forward propagation in Python by initializing layer sizes, weights, and biases with small random values, applying sigmoid activation, and producing predictions.
Backpropagation10:21
Explain how to update neural network parameters with backpropagation, using gradient descent and the chain rule to compute layer-by-layer derivatives for binary cross entropy loss.
Back Propagation Equation Derivations12:27
Derives backpropagation equations and their vectorized form for a two-layer neural network, showing gradient descent updates for weights and biases, and generalizes to networks with any number of layers.
Backpropagation Lab20:22
Implement and train a neural network by applying back propagation, including the loss function, sigmoid derivative, and gradient descent, with forward and back propagation and parameter initialization.
Understanding Hidden Layers10:39
See how hidden layers enable neural networks to learn via backpropagation, using nonlinear activation like the sigmoid to remap inputs into hidden features. Layers stack to learn complex mappings.
Weight Intialization4:23
Explain why zero initialization causes symmetry and prevents learning in neural networks, and show how random, small weight initialization enables updates in hidden layers, while biases need not be randomized.
Multi-Task and Multi-Class Classification9:22
Explores multitask and multiclass classification, showing how to adapt a neural network to predict multiple diseases with a vector output, using softmax and categorical cross-entropy loss.
Derivatives of Softmax and Categorical Cross Entropy9:05
Explore multiclass classification by applying softmax in the last layer and using categorical cross-entropy loss, then derive the gradient of the loss with respect to the last layer z values.
Multi-Class Classification Lab22:57
Explore multiclass classification with handwritten digits by implementing a neural network using categorical entropy and softmax, then compare training, validation, and testing accuracy with various hidden layers.
The Vanishing Gradient Problem and ReLu Activation Function5:30
Explain how the vanishing gradient problem slows learning in deep networks due to the sigmoid derivative shrinking weight updates. Introduce ReLU as a practical non-linear activation to keep updates effective.
Relu Lab10:23
Replace the sigmoid with the relu activation to address vanishing gradient and compare no-hidden, one-hidden, and large networks, noting training gains and overfitting risks.
Confusion Matrix Analysis16:13
Learn to analyze supervised learning models using confusion matrix analysis with a multiclass digit classifier, measure accuracy, interpret misclassifications, and visualize results to identify overfitting.
Overfitting7:15
Understand how larger neural networks can overfit by memorizing training data. Learn prevention techniques such as smaller models, more data, early stopping, and L1/L2 regularization plus dropout.
Batching Theory3:48
Explore batch and mini-batch gradient descent, showing how dataset size and batch size affect updates, convergence, and training stability versus stochastic gradient descent.
Batching Lab13:26
Implement 128-size batching to track training accuracy per batch and average it for iteration accuracy; compare batching with full-batch, include data shuffling and stochastic gradient descent with batch size one.
Code Cleanup9:45
Learn how to refactor supervised learning code by extracting parameter updates into a dedicated optimizer class, implementing gradient descent, removing learning rate, and supporting multiple optimizers for clearer, extensible training.
Optimizers - Momentum4:29
Explore momentum as gradient descent with a weighted average of past gradients, reducing local minima risks and smoothing updates to improve neural network training and performance.
Optimizers - Momentum Lab6:32
Implement momentum in gradient descent using moving averages to update parameters with a beta. Momentum affects testing accuracy, helping small models but hindering large ones, and varies by optimizer.
Optimizers - RMS prop3:29
RMSProp uses the moving average of squared gradients to divide the derivative by the square root. This scales learning rate, speeds up flat regions, slows steep regions, with an epsilon.
RMSprop Lab10:29
Implement the RMSprop optimizer and compare it with gradient descent with momentum, updating parameters via the squared gradient running average to stabilize softmax.
Optimizers - Adam3:27
Adam optimizer, a blend of momentum and RMSProp, uses bias-corrected terms to yield accurate weighted averages during early updates, with alpha, beta1, beta2, and epsilon.
Optimizers - Adam Lab8:08
Implement the Adam optimizer in the code lab, initialize the time step and momentum terms, and update parameters using corrected weighted sums of derivatives and squared derivatives.

CNN Section Overview2:05
Introduce convolutional neural networks (CNNs) and their real-world applications, from image classification and style transfer to object detection in bloodstream images; cover padding, strides, transfer learning, and visualization.
Image Data1:39
Explore convolutional neural networks with a focus on image data. Understand how images are represented as pixel matrices with channels for grayscale, red, green, blue, and the alpha transparency channel.
Filters5:27
Explore how filters power convolutional neural networks by detecting edges and features through a convolutional operator, producing activation maps that reveal image differences across channels.
Padding1:01
Use zero padding to keep the activation the same size as the input, padding on each side according to the kernel size to shape the activation map.
Strides2:31
Explore strides in convolution, applying filters across an image with stride one or more, using dot products, padding options, and output size calculations.
Reshaping7:13
Learn how reshaping converts vectors and tensors into matrices for convolutional networks, mapping 784-element vectors to 28×28 images and handling three-dimensional, four-dimensional, and five-dimensional shapes while preserving order.
Introducton to Convolutional Neural Networks5:15
Learn how convolutional neural networks apply convolutional layers to 28x28 grayscale inputs, learn filters during training, and flatten activations for classification; explore stride and peeling layers to reduce parameters.
Convolutional Neural Networks Forward Propagation13:57
Describe forward propagation in a cnn: apply 3x3 filters with padding and stride, reshape patches into a strips matrix, multiply by filter weights, add bias, apply activation, and softmax.
CNN Forward Propagation Lab Part 1 - Parameter Initialization8:46
Implement CNN forward propagation in a classification model by initializing convolutional filters and biases, handling input dimensions, padding, strides, and transitioning to fully connected layers with correct sizing.
CNN Forward Propagation Lab Part 2 - Forward Propagation Method12:52
Initialize h0 activation for a four-dimensional batch of images, then perform padding, extract patches, reshape to strips, and compute activations through convolutional layers before connecting to dense layers.
CNN Forward Propagation Lab Part 3 - Extract Patches and Test8:17
Extract patches from images, reshape to four-dimensional tensors, and test forward propagation in a CNN with patch-based processing using Python and TensorFlow 2.
Convolutional Neural Networks Backpropagation5:47
Apply backpropagation in convolutional neural networks by deriving gradients through activation maps, patches, and padded inputs using 3x3 windows and strides.
Convolutional Neural Networks Backpropagation Lab22:35
Implement backpropagation for a convolutional neural network in a lab, adapting derivatives across convolutional and fully connected layers. Reshape activation maps, handle padding and patch stitching, and validate training accuracy.
Pooling Layers9:23
Explore pooling layers that reduce spatial dimensions in convolutional neural networks using two-by-two windows. Learn how average pooling and max pooling affect translational invariance and forward and backward propagation.
Pooling Lab Part 1 Forward Propagation (optional)10:29
Learn to implement a pooling layer in a cnn through forward propagation, adding a pooling window parameter, performing max pooling, and tracking max indices for backpropagation in this optional lab.
Pooling Lab Part 2 - Backpropagation (optional)10:51
Learn backpropagation through a convolutional neural network with max pooling from scratch, computing activation derivatives and patch stitching to backpropagate through the pooling layer.
Introduction to Tensorflow Keras Part 117:50
Explore using TensorFlow and Keras to build and train neural networks with high-level APIs. Learn data loading, normalization, and model definitions—sequential, functional, and subclassing—plus training with fit and optimizers.
Introduction to Tensorflow Keras Part 217:12
Train and evaluate a basic TensorFlow Keras model, use predict outputs, and implement sparse and categorical cross entropy while building convolutional networks with checkpoints.
Creating a Custom Image Dataset - Part 1 Data Preparation21:41
Pre-process an image dataset for classification by standardizing to 256 by 256 region via center extraction, organize class folders, and plan batch loading with tf.data for memory efficiency in Colab.
Creating a Custom Image Dataset - Part 2 Creating a Tensorflow Record19:17
Create a Tensorflow record from a large image dataset by cropping images, encoding them as bytes, and storing image and class features, then split into train, validation, and test.
Using Tensorflow Records for Training18:18
Upload and load tfrecord datasets in Colab, map records to images and labels, train a CNN on a larger dataset with GPU, and assess overfitting.
A Brief History of CNNs for Image Classifications9:24
Trace the history of CNNs for image classification from two-stage methods to end-to-end models like AlexNet, Inception, ResNet, and EfficientNet, highlighting data augmentation and transfer learning.
AlexNet Implementation Part 1 Data Preparation17:15
Explore AlexNet data preparation for training a CNN, including zero-centering with batch-wise mean values and data augmentation with random crops, flips, and brightness adjustments.
AlexNet Implementation part 2 Model Definition13:52
Define an AlexNet-style CNN in Keras using the functional API with conv layers, max pooling, dropout, and dense softmax classifiers; train with data augmentation and discuss transfer learning.
Transfer Learning20:28
Explore transfer learning with a pre trained image model, replace the final layer for 256 classes, freeze and fine tune layers using global average pooling, dropout, and softmax.
Occlusion Sensitivity17:01
Explore occlusion sensitivity to visualize convolutional neural networks by sliding a window across images and creating heat maps that reveal where the model detects the snail class.
Style Transfer6:25
Explore neural style transfer that blends a content image with a style image by optimizing content and style losses via CNN activations and Gram matrices.
Style Transfer Lab Part 1 - Setup8:21
Set up a neural style transfer lab by loading content and style images, using a frozen VGG19 model, and extracting multi-layer activation maps for content and style losses.
Style Transfer Lab Part 2 - Gram Matrix and Losses17:42
Compute gram matrices from activation maps to quantify style, extract content and style activations from content and style images, and optimize a generated image using combined content and style losses.
Style Transfer Lab Part 3 - Training and Results9:38
Learn to perform style transfer with a gradient tape training loop, activation maps, and clipping, adjusting style weight and momentum for evolving stylized images.
One Shot Learning Overview11:18
Explore one-shot learning with a metric-based approach that trains a distance function to distinguish face categories using embeddings, triplet loss, and a 128-d vector, enabling face verification and recognition.
Face Verification and Recognition Lab19:52
Learn face verification and recognition with a pre-trained model and a facial recognition library. Compute 128-d embeddings to perform one-to-one verification and one-to-many recognition against a labeled database.
Object Detection Architecture and Label Format14:35
Object detection with neural networks locates multiple objects in images using bounding boxes and a yolo-style grid that predicts centers, box coordinates, and class probabilities.
Object Detection Loss Function.mp46:22
Learn how object detection loss combines binary cross entropy for object presence with regression losses for center coordinates and size, using sqrt-transformations and multi-label class losses across anchors.
Object Detection Lab Part 1 - Setup16:20
Train an object detection model to locate blood cells in images by parsing XML annotations, scaling bounding boxes, and using a pre-trained network.
Object Detection Lab Part 2 - Label Creation Loss Function and Training17:54
Select three anchor boxes via k-means elbow analysis for a 19x19 grid. Create IoU-based labels, apply a custom loss, train, and save the model.
Object Detection Making Predictions and Evaluating13:58
Convert 7x7x16 prediction matrices to bounding boxes with anchor boxes, then evaluate using IoU, non-maximum suppression, and F1 score.
Object Detection Lab Part 3 - Extracting Predictions14:34
Extract predictions from the 3-D matrix to compute bounding boxes on images, scale from 600x600 input to 416x416, apply non-maximum suppression, and assign class labels with a 0.25 threshold.
Object Detection Lab Part 4 - Non-max Suppression10:38
Apply non-max suppression to filter overlapping bounding boxes by class and iou threshold, keeping the most confident predictions.
Object Detection Lab Part 5 - F1 Score14:18
Learn to evaluate object detection with IOU-based metrics, precision, recall, true positives, false positives and false negatives, and compute classwise matrices and overall F1 scores across validation data.
CNN Section Summary1:11
Master convolutional neural networks through labs on data augmentation, custom loss functions, and model visualization, then apply transfer learning, early stopping, and a precision-recall metric for evaluation.

Sequential Data Overview2:37
Explore sequential data and order-dependent architectures, build a from-scratch recurrent neural network for sentiment classification, and read and generate text from Romeo and Juliet using attention in GBG three.
Recurrent Neural Networks15:59
Explore recurrent neural networks, weight sharing across time steps, and hidden states for sequential tasks like sentiment analysis and language translation using encoder-decoder architectures.
Forward Propagation for RNNs6:07
Learn forward propagation for RNNs in sentiment classification, using time-step processing, shared weight matrices, biases, one-hot encoding, activation functions, and batching.
Data Prep and Forward Propagation Lab25:37
Format text data for sentiment classification by building a vocabulary, padding sentences, and encoding labels; then implement a forward propagation neural network in Python with TensorFlow 2.
Backpropagation for RNNs5:28
Learn to implement backpropagation for RNNs by defining binary cross-entropy loss, deriving time-step weight updates for the weight matrices, and applying shared derivatives across all timesteps.
Backpropagation and vanishing Gradient Lab24:26
Implement backpropagation in a neural network to predict sentences, explore vanishing gradient with longer sequences, and discuss mitigation using long short term memory.
LSTM Theory11:30
Explore how long short-term memory networks address vanishing gradients by using a memory cell with forget, input, and output gates to preserve information across longer sequences.
RNNs, LSTMS and GRUs in Tensorflow Lab14:27
Implement rnn, lstm, and gru models in TensorFlow to classify sentiment, loading basic and longer sentences, training with embeddings, and compare architectures and training strategies.
Character Based Text Generation31:42
Master character-based text generation by training a small lstm model on Romeo and Juliet text in Google Colab, converting text to indices, one-hot encoding, and sampling new sequences.
Word Embeddings6:36
Explore word embeddings that replace sparse one-hot vectors with dense representations, enabling similar words to share meaning through context in sentences, while noting they are not always interpretable.
Exploring GloVe Word Embeddings Lab16:45
Explore GloVe word embeddings by loading a 300-dimensional model trained on 840 billion tokens, mapping words to vectors, and using cosine similarity for semantic analogies.
Advanced Sentiment Classification with GloVe19:26
Explore advanced sentiment classification using GloVe embeddings on airline reviews, map words to embeddings, create train/validation/test splits, and train a neural network achieving about 76% test accuracy.
Advanced Sentiment Classification with BERT20:10
Apply transformer-based BERT embeddings to sentiment classification, using a preprocessing and embedding layer, fine-tuning with Adam and sigmoid output to beat glove results with ~83% test accuracy.
Attention Models Theory8:40
Learn how the attention mechanism enhances encoder-decoder models by using an attention layer to compute a context vector from input hidden states for tasks like machine translation and text summarization.
Attention Models Lab Part 1 Model Definition and Training34:48
Define, train, and visualize an attention model that maps diverse date formats to a standard date, using data loading, one-hot encoding, and a custom label matches metric.
Attention Models Lab Part 2 Visualizing Attention7:51
Run the model in eager mode to extract attention values for a selected index, then plot a grayscale attention heatmap showing input characters the model focuses on when generating output.
Sequential Data Summary0:58
Explore sequential data techniques for sentiment classification and text generation, and apply them to audio, sales forecasting, and image captioning to produce descriptive text.

Requirements

Secondary Level (High School) Mathematics
Some basic programming experience in Python

Description

Gain a deep understanding of Supervised Learning techniques by studying the fundamentals and implementing them in NumPy.

Gain hands-on experience using popular Deep Learning frameworks such as Tensorflow 2 and Keras.

Section 1 - The Basics:

- Learn what Supervised Learning is, in the context of AI

- Learn the difference between Parametric and non-Parametric models

- Learn the fundamentals: Weights and biases, threshold functions and learning rates

- An introduction to the Vectorization technique to help speed up our self implemented code

- Learn to process real data: Feature Scaling, Splitting Data, One-hot Encoding and Handling missing data

- Classification vs Regression

Section 2 - Feedforward Networks:

- Learn about the Gradient Descent optimization algorithm.

- Implement the Logistic Regression model using NumPy

- Implement a Feedforward Network using NumPy

- Learn the difference between Multi-task and Multi-class Classification

- Understand the Vanishing Gradient Problem

- Overfitting

- Batching and various Optimizers (Momentum, RMSprop, Adam)

Section 3 - Convolutional Neural Networks:

- Fundamentals such as filters, padding, strides and reshaping

- Implement a Convolutional Neural Network using NumPy

- Introduction to Tensorfow 2 and Keras

- Data Augmentation to reduce overfitting

- Understand and implement Transfer Learning to require less data

- Analyse Object Classification models using Occlusion Sensitivity

- Generate Art using Style Transfer

- One-Shot Learning for Face Verification and Face Recognition

- Perform Object Detection for Blood Stream images

Section 4 - Sequential Data

- Understand Sequential Data and when data should be modeled as Sequential Data

- Implement a Recurrent Neural Network using NumPy

- Implement LSTM and GRUs in Tensorflow 2/Keras

- Sentiment Classification from the basics to the more advanced techniques

- Understand Word Embeddings

- Generate text similar to Romeo and Juliet

- Implement an Attention Model using Tensorflow 2/Keras

Who this course is for:

Beginner Python programmers curious about Artificial Intelligence
People looking for an AI course that teaches both the theoretical and practical aspects of Artificial Intelligence

Supervised Learning for AI with Python and Tensorflow 2

What you'll learn

Explore related topics

Course content

Introduction4 lectures • 23min

The Basics18 lectures • 3hr 26min

Feedforward Neural Networks32 lectures • 5hr 3min

Convolutional Neural Networks41 lectures • 8hr 4min

Sequential Data17 lectures • 4hr 13min

Conclusion1 lecture • 5min

Requirements

Description

Who this course is for: