What you'll learn

To understand deep learning and reinforcement learning paradigms
To understand Architectures and optimization methods for deep neural network training
To implement deep learning methods within Tensor Flow and apply them to data
To understand the theoretical foundations and algorithms of reinforcement learning
To apply reinforcement learning algorithms to environments with complex dynamics

Course content

15 sections • 128 lectures • 18h 58m total length

Introduction to Deep Reinforcement Learning8:28
This lecture introduces reinforcement learning, covering fundamental concepts and their connection to deep learning and neural networks. The course is structured into two main parts: foundational deep learning principles, including numerical methods and coding exercises, followed by reinforcement learning, exploring different agent types and mechanisms.
Initially, deep learning is introduced to understand the neural networks that reinforcement learning (RL) agents use. Following this, the focus shifts to RL, covering agents, environments, rewards, and punishments. The course aims to make students proficient in understanding RL agents, their neural network models, and how these elements integrate to form effective learning systems.
Key topics include defining RL mechanics and components and comparing RL with supervised and unsupervised learning. A historical overview is given, from RL's origins in robotics to its applications in fields like ChatGPT's model development, where RL plays a crucial role. Practical applications and examples demonstrate how RL agents operate in dynamic scenarios, learning through trial and error to maximize cumulative rewards. The distinction between RL’s dynamic learning versus the static nature of traditional machine learning is highlighted, emphasizing RL's adaptability and real-time learning.
The lecture explains RL's core mechanisms: trial and error, delayed rewards, and sequential decision-making. These are linked to human learning patterns, where agents maximize rewards through situational actions. Comparisons with static machine learning illustrate RL's continuous adaptation in various scenarios, reinforcing its role as an evolving subset within the broader AI domain alongside machine and deep learning.
Reinforcement Learning and its main components (agent, environment, rewards)20:25
Comparison with supervised and unsupervised learning18:21
Overview of the RL history4:06
Recent advances in Deep Reinforcement Learning5:07
Learning objectives for the course and Introduction to Python11:06
Experts' review on Introduction of the course10:39

Review of Reinforcement Learning22:49
Introduction to Value Function Approximation1:26
Python Code: Value Function Approximation using CartPole12:37
Linear function approximation0:51
Python Code: Linear Function Approximation using CartPole1:44
Non-linear function approximation with deep neural networks0:37
Python Code: Non-Linear Function Approximation with Neural Networks3:28
Applications and limitations of Value Function Approximation0:50
Definition of Markov Decision Processes (MDPs)0:59
Python Code: MDPs and Bellman Equations and Value Functions9:15
Key components of an MDP3:45
Bellman Equations and Value Functions0:45
Policy iteration and value iteration algorithms18:34
Python Code: Policy iteration and value iteration algorithms10:41
Experts' review on Markov Decision Processes and Applications17:44

Python Code: Introduction to Python Gym Library Documentation8:01
Review of Bellman Equations4:14
Definition of value functions (state value, action value)2:20
Calculation of value functions using Bellman Equations1:58
Intuitive interpretation of value functions3:00
Markov Processes12:01
Markov Reward Processes14:11
Markov Decision Processes17:39
Extensions to MDPs6:56
Experts' review on Bellman Equations and Value Functions18:14

Definition of Q-Learning1:11
Calculation of Q-Values using Q-Learning3:43
Python Code: Q-Learning and Python Gym library7:06
Comparison of Q-Learning with policy iteration and value iteration algorithms0:49
Advantages and disadvantages of Q-Learning2:13
Overview of Deep Q-Network (DQN) algorithm5:01
Architecture of a DQN model1:29
Implementation of DQN in TensorFlow3:24
Python Code: Implementation of DQN20:36
Applications and limitations of DQN3:22
Experts' review on Q-Learning in Frozen Lake25:29

Requirements

Basic python programming but not necessary

Description

This course is the integration of deep learning and reinforcement learning. The course will introduce student with deep neural networks (DNN) starting from simple neural networks (NN) to recurrent neural network and long-term short-term memory networks. NN and DNN are the part of reinforcement learning (RL) agent so the students will be explained how to design custom RL environments and use them with RL agents. After the completion of the course the students will be able:

To understand deep learning and reinforcement learning paradigms
To understand Architectures and optimization methods for deep neural network training
To implement deep learning methods within Tensor Flow and apply them to data.
To understand the theoretical foundations and algorithms of reinforcement learning.
To apply reinforcement learning algorithms to environments with complex dynamics.

Course Contents:

Introduction to Deep Reinforcement Learning
Artificial Neural Network (ANN)
ANN to Deep Neural Network (DNN)
Deep Learning Hyperparameters: Regularization
Deep Learning Hyperparameters: Activation Functions and Optimizations
Convolutional Neural Network (CNN)
CNN Architecture
Recurrent Neural Network (RNN)
RNN for Long Sequences
LSTM Network
Overview of Markov Decision Processes
Bellman Equations and Value Functions
Deep Reinforcement Learning with Q-Learning
Model-Free Prediction
Deep Reinforcement Learning with Policy Gradients
Exploration and Exploitation in Reinforcement Learning

Who this course is for:

Data Scientists
Machine Learning Engineers
Robotics Programmer

What you'll learn

Explore related topics

Course content

Introduction7 lectures • 1hr 18min

Artificial Neural Network (ANN)4 lectures • 47min

ANN to Deep Neural Network (DNN)8 lectures • 2hr

Deep Learning Hyperparameters Regularization9 lectures • 55min

Deep Learning Hyper parameters, Activation Functions and Optimizations8 lectures • 1hr 5min

Convolutional Neural Network (CNN)4 lectures • 42min

Recurrent Neural Network (RNN)6 lectures • 1hr 35min

Reinforcement Learning: Overview of Markov Decision Processes15 lectures • 1hr 46min

Bellman Equations and Value Functions10 lectures • 1hr 29min

Deep Reinforcement Learning with Q-Learning11 lectures • 1hr 14min

Requirements

Description

Who this course is for: