Deep Learning and Generative Artificial Intelligence

Name: Deep Learning and Generative Artificial Intelligence
Rating: 4.5 (34 reviews)

CNNs, LSTMs, GANs, VAEs, Transformers (including GPTs) and Stable Diffusion

Created byLuís Cunha, PhD

Last updated 10/2024

English

What you'll learn

Learn the basic principles of artificial neural networks and how they are trained.
Implement and train Convolutional Neural Networks (CNNs) for image classification and object detection using Python.
Design and apply Long Short-Term Memory (LSTM) networks to predict and analyze time series data.
Construct, fine-tune, and deploy Transformer models, such as GPT-type models, for various natural language processing tasks.
Create and train Generative Adversarial Networks (GANs) to generate realistic synthetic images and data.
Build and utilize Variational Auto-Encoders (VAEs) for data compression and generation tasks.
Apply style transfer and stable diffusion methods to creatively alter and enhance images.

Course content

14 sections • 182 lectures • 4h 43m total length

Welcome to the Course0:57
Foundations 010:46
Introduction to the Course
Foundations 020:29
Deep Learning in the context of AI and Machine Learning
Foundations 030:21
Deep Learning: Getting Rules from Data + Answers
Foundations 040:19
The human brain: an inspiration for many of today's AI "godfathers"
Foundations 050:11
Foundations 060:19
Biological Neuron Action Potential (Signal Propagation in "Real" Neurons)
Foundations 070:21
Activation Function of an Artificial Neuron
Foundations 080:17
Comparison of a Biological Neural Network and a Simple Artificial Neural Network
Foundations 090:35
Foundations 100:44
Foundations 110:32
Foundations 120:45
Foundations 130:35
We compare the human brain's 16 billion neurons with about 7000 connections each, totaling 112 trillion connections, to simple artificial neural networks, illustrating the vast complexity gap.
Foundations 140:45
Analyze how GPT three with 175 billion parameters and trillion-parameter models like Pangas illustrate exponential growth in AI complexity, suggesting future AI could rival human cognitive capacity.
Foundations 151:07
Foundations 160:35
Foundations 170:34
Foundations 180:45
Foundations 191:10
Foundations 200:30
Foundations 210:49
Follow back propagation in a neural network, from input through a hidden layer to output, evaluate prediction error, and apply gradients to adjust weights.
Foundations 221:18
Foundations 230:30
Foundations 241:10
Foundations 250:57
Foundations 261:22
Foundations 270:40
Foundations 280:27
Foundations 291:00
Foundations 300:40
Foundations 310:52
Foundations 322:00
Foundations 330:43
Foundations 340:48
Calculate accuracy as (true positives + true negatives) / total, demonstrated with true positives 161 and true negatives 129 out of 320, illustrating overall model performance.
Foundations 351:15
Foundations 361:05
Foundations 370:44
Foundations 380:34
Foundations 391:16
Explore how recall, or sensitivity, gauges a model's ability to identify positive cases. Pair it with specificity to reduce false positives in fraud detection and disease diagnostics.
Foundations 400:56
Foundations 410:52
Foundations 420:10

AI for Vision - Part 010:46
AI for Vision - Part 021:44
AI for Vision - Part 030:29
AI for Vision - Part 040:17
AI for Vision - Part 050:36
AI for Vision - Part 060:26
See how a convolutional neural network is structured and implemented. Gain insight into CNN concepts through a first general view and a typical code implementation.
AI for Vision - Part 070:25
AI for Vision - Part 081:39
Explore how neural networks process images by using ReLU in hidden layers and softmax in the output layer for multi-class classification, with 28x28 grayscale inputs and RGB color encoding.
AI for Vision - Part 091:06
AI for Vision - Part 101:01
AI for Vision - Part 110:59
Explore how a convolutional neural network processes input data, images as pixel grids, grayscale values 0-255, and color channels red, green, blue, for filtering and image recognition.
AI for Vision - Part 120:30
AI for Vision - Part 130:37
AI for Vision - Part 140:31
AI for Vision - Part 150:53
AI for Vision - Part 160:24
Slide a three by three filter over the input image, perform element-wise multiplication with each patch, and sum results to form feature map, with filter values learned during CNN training.
AI for Vision - Part 170:18
AI for Vision - Part 180:11
AI for Vision - Part 190:28
AI for Vision - Part 200:47
AI for Vision - Part 210:49
AI for Vision - Part 220:26
CNNs, with their architecture, apply to many tasks by combining convolution, ReLU, and pooling layers to learn complex features for classification, object detection, segmentation, and probabilistic control.
AI for Vision - Part 230:22
AI for Vision - Part 240:27
AI for Vision - Part 250:30
AI for Vision - Part 260:48
AI for Vision - Part 270:58
AI for Vision - Part 280:33
AI for Vision - Part 290:39
AI for Vision - Part 300:41
Engage with interactive playgrounds to test and visualize CNN predictions, deepening your understanding of CNN-based deep learning concepts through hands-on code demos and GitHub-backed experimentation.

Deep Learning for Time Series 010:53
Deep Learning for Time Series 020:43
Deep Learning for Time Series 030:49
Explore recurrent neural networks and how they recognize patterns in sequences, enabling time series forecasting through variable-length data and sharing parameters across the sequence.
Deep Learning for Time Series 040:22
Deep Learning for Time Series 050:23
Deep Learning for Time Series 060:24
Identify the vanishing gradients problem, where gradients shrink through repeated multiplication of weights less than one, causing updates to vanish and the training process to stall.
Deep Learning for Time Series 070:44
Deep Learning for Time Series 080:24
Deep Learning for Time Series 090:28
Deep Learning for Time Series 100:25
RNNs capture differences in sequence order to understand context in time series. They distinguish subtle meaning changes with the same words, a key strength for modeling sequential data.
Deep Learning for Time Series 110:52
Explore how LSTM networks overcome standard RNN limits with forget, store, update, and output gates that control the cell state and sustain gradient flow for long-term dependencies.
Deep Learning for Time Series 120:40
Deep Learning for Time Series 130:39
Utilize an LSTM-based time-series model to predict the S&P 500 index price using historical data since 1990, including volatility, rates, unemployment, sentiment, and new indicators beyond the original paper.
Deep Learning for Time Series 140:27
Deep Learning for Time Series 150:47
Deep Learning for Time Series 160:38
Deep Learning for Time Series 170:21
Deep Learning for Time Series 180:42
Deep Learning for Time Series 190:19
Explore how indicators behave over time in time series data by visualizing historical trends and their relation to closing prices, building an intuitive understanding before modeling.
Deep Learning for Time Series 200:21
Deep Learning for Time Series 211:25
Deep Learning for Time Series 220:19

The Transformer Model in Language Processing 010:22
The Transformer Model in Language Processing 020:22
Explore transformers as the leading model for language processing since 2017, powering ChatGPT, and extending to image and time sequence processing, while noting emerging successor architectures.
The Transformer Model in Language Processing 030:29
The Transformer Model in Language Processing 040:37
Understand how the transformer model uses an encoder-decoder architecture, with multi-head attention, feed-forward layers, and positional encodings, to compute output probabilities via softmax for efficient language processing.
The Transformer Model in Language Processing 050:34
The Transformer Model in Language Processing 060:30
The Transformer Model in Language Processing 070:34
Explore how word embeddings map semantic relationships in a vector space, showing how similar contexts link words like man and woman, guiding translation and text generation.
The Transformer Model in Language Processing 080:25
The Transformer Model in Language Processing 090:41
Explain how positional encoding in the transformer adds position information to input embeddings, enabling parallel sequence processing and unique sine and cosine encodings for each token position.
The Transformer Model in Language Processing 100:41
The Transformer Model in Language Processing 110:20
The Transformer Model in Language Processing 120:20
Explore the transformer architecture with a focus on multi-head attention, enabling the model to attend to different input parts simultaneously and boost translation and summarization performance.
The Transformer Model in Language Processing 130:26
See how a single attention head in a transformer links the word dog to other words, using query, keys, and values to weight dependencies and focus on sentence parts.
The Transformer Model in Language Processing 140:23
The Transformer Model in Language Processing 150:21
The Transformer Model in Language Processing 160:23
The Transformer Model in Language Processing 170:19
The transformer generates key vectors from word embeddings using a key weight matrix, enabling comparison with query vectors to compute attention scores.
The Transformer Model in Language Processing 180:18
The Transformer Model in Language Processing 190:21
The Transformer Model in Language Processing 200:23
The Transformer Model in Language Processing 210:23
The Transformer Model in Language Processing 220:22
The Transformer Model in Language Processing 230:26
The Transformer Model in Language Processing 240:20
Process input embeddings with positional encodings through self-attention and feed-forward networks in a transformer encoder to capture complex patterns and pass the final result for further processing.
The Transformer Model in Language Processing 250:21
The Transformer Model in Language Processing 260:21
Explore the transformer encoder architecture, including add and normalize layers, residual connections, and the combined power of self-attention and feedforward networks to capture language dependencies.
The Transformer Model in Language Processing 270:35
The Transformer Model in Language Processing 280:33
The Transformer Model in Language Processing 290:14
Explore transformer models in language processing through code demos and transformers beyond code, with playgrounds from various companies and platforms.

Requirements

Basic understanding of programming concepts is recommended, but not required. Familiarity with Python will be helpful for coding exercises. Access to a computer with internet connection for using demos and playgrounds.

Description

Welcome to the Deep Learning and Generative Artificial Intelligence course! This comprehensive course is designed for anyone interested in diving into the exciting world of deep learning and generative AI, whether you're a beginner with no programming experience or an experienced developer looking to expand your skill set.

What You Will Learn:

Foundations of Deep Learning and Artificial Neural Networks: Gain a solid understanding of the basic concepts and architectures that form the backbone of modern AI.
Convolutional Neural Networks (CNNs): Learn how to implement and train CNNs for image classification and object detection tasks using Python and popular deep learning libraries.
Long Short-Term Memory (LSTM) Networks: Explore the application of LSTM networks to predict and analyze time series data, enhancing your ability to handle sequential data.
Transformer Models: Dive into the world of Transformer models, including GPT-type models, and learn how to construct, fine-tune, and deploy these models for various natural language processing tasks.
Generative Adversarial Networks (GANs): Understand the principles behind GANs and learn how to create and train them to generate realistic synthetic images and data.
Variational Auto-Encoders (VAEs): Discover how to build and utilize VAEs for data compression and generation, understanding their applications and advantages.
Style Transfer and Stable Diffusion: Experiment with style transfer techniques and stable diffusion methods to creatively alter and enhance images.

Course Features:

Interactive Coding Exercises: Engage with hands-on coding exercises designed to reinforce learning and build practical skills.
User-Friendly Demos and Playgrounds: For those who prefer a more visual and interactive approach, our course includes demos and playgrounds to experiment with AI models without needing to write code.
Real-World Examples: Each module includes real-world examples and case studies to illustrate how these techniques are applied in various industries.
Project-Based Learning: Apply what you've learned by working on projects that mimic real-world scenarios, allowing you to build a portfolio of AI projects.

Who Should Take This Course?

Aspiring AI Enthusiasts: Individuals with no prior programming experience who want to understand and leverage AI through intuitive interfaces.
Developers and Data Scientists: Professionals looking to deepen their understanding of deep learning and generative AI techniques.
Students and Researchers: Learners who want to explore the cutting-edge advancements in AI and apply them to their studies or research projects.

Who this course is for:

This course is designed for anyone interested in deep learning and generative AI, including beginners with no programming experience who want to use AI through user-friendly interfaces, as well as programmers looking to deepen their understanding and skills in this field.

Deep Learning and Generative Artificial Intelligence

What you'll learn

Explore related topics

Course content

Foundations of Modern AI43 lectures • 33min

Playground for the Foundational Part of the Course1 lecture • 43min

Code demos for the Foundational Part of the Course3 lectures • 20min

Artificial Intelligence for Visual Tasks30 lectures • 20min

Playgrounds for AI for Vision2 lectures • 22min

Code demos of AI for Computer Vision3 lectures • 17min

Deep Learning for Time Series22 lectures • 13min

Code Demo for Part 2 - Time Series1 lecture • 3min

Deep Learning for Language - The Transformer Model29 lectures • 12min

Code Demos - Language and AI1 lecture • 6min

Requirements

Description

Who this course is for: