Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Reinforcement Learning with Pytorch

Name: Reinforcement Learning with Pytorch
Rating: 4.3 (402 reviews)

Learn to apply Reinforcement Learning and Artificial Intelligence algorithms using Python, Pytorch and OpenAI Gym

Created byAtamai AI Team

Last updated 8/2020

English

What you'll learn

Reinforcement Learning basics
Tabular methods
Bellman equation
Q Learning
Deep Reinforcement Learning
Learning from video input

Course content

8 sections • 69 lectures • 7h 14m total length

Welcome!2:22
Before you start - Videos quality!0:45
Resources1:07

Introduction #14:28
Introduction #25:09
Introduction #34:33
Introduction #46:00
Environment setup / Installation1:14
Lab. OpenAI Gym #13:06
Lab. OpenAI Gym #210:40
Here to clarify the doubts and to have clear understanding about terms:

episode -  episode is single instance of a game - so in case of CartPole it is since when single game starts up to the moment when this game finishes... so according to explanation from OpenAI website it is :
"The episode ends when the pole is more than 15 degrees from vertical, or the cart moves more than 2.4 units from the center."

So technically until done flag is set to True.

  For example we do our training for 500 episodes... so we play game 500 times...

step - it is single move of our agent - so in case of CartPole it is single move left or right... for single episode (single instance of a game) - we have multiple steps (multiple agent's actions) to reach the main goal - not to let pole fall over.
Lab. OpenAI Gym #32:57
Lab. OpenAI Gym #46:00

Deterministic & Stochastic environments7:28
Rewards4:31
Bellman equation #16:11
Bellman equation #22:57
Resource - code0:21
Lab. Algorithm for deterministic environments #110:23
Lab. Algorithm for deterministic environments #210:45
Lab. Algorithm for deterministic environments #310:34
Lab. Algorithm for deterministic environments #46:24
Lab. Test with stochastic environment4:04
Q-Learning10:08
Lab. Algorithm for stochastic environments6:50
Exploration vs Exploitation2:56
Lab. Egreedy7:34
Lab. Adaptive egreedy5:39
Bonus Lab. Value iteration11:32
Homework4:30
Homework. Solution5:17
Homework. Tuning10:23

Scaling up6:04
Neural Networks review4:00
Lab. Neural Networks review #19:51
Here it's worth mentioning that when we set random seed for our environment - in configuration provided in this video
env.action_space.sample()
will still get fully random (not repeatable) results.
This is due to that fact that for sample moves for some reasons Gym uses different approach to get randomness. More details here:
https://github.com/openai/gym/blob/339415aa03a9b039a51f67798a44f8cd21464091/gym/spaces/box.py#L28-L29
So if you also need to "fix" randomness of sample moves - you have to use:
from gym.spaces.prng import seed
seed(seed_value)
Lab. Neural Networks review #210:08
Lab. Random CartPole6:45
Lab. Epsilon egreedy revisited2:37
Lab. Pytorch updated ( version 0.4.0 )7:35
Article. Pytorch updated! (further versions)0:10
Lab. OpenAI Gym + Neural Network #110:04
Lab. OpenAI Gym + Neural Network #28:53
Lab. OpenAI Gym + Neural Network #34:35
Lab. Extended logging10:18

CNN Review5:53
Lab. Random Pong8:32
File atari_wrappers.py (from OpenAI github page) has to be downloaded to the same location as all other files. It has to be in same directory - because then we will be able to import it directly in our code.
Saving & Loading the Model1:19
Lab. Pong from video output #19:53
Lab. Pong from video output #29:01
Lab. Pong from video output #310:10
Lab. Pong from video output #410:13
Lab. Pong from video output #59:05
Lab. Pong from video output #610:34
Here I also recommend giving a try with update_target_frequency = 2000. I noticed that sometimes it give even better results (game is resolved faster)!
Potential improvements4:09
Article. Stacking 4 images together1:11

Requirements

Basic python knowledge is needed. AI / Machine Learning / Pytorch basics - nice to have but not fully necessary. Only open source tools will be in use.

Description

UPDATE:

All the code and installation instructions have been updated and verified to work with Pytorch 1.6 !!

Artificial Intelligence is dynamically edging its way into our lives. It is already broadly available and we use it - sometimes even not knowing it - on daily basis. Soon it will be our permanent, every day companion.

And where can we place Reinforcement Learning in AI world? Definitely this is one of the most promising and fastest growing technologies that can eventually lead us to General Artificial Intelligence! We can see multiple examples where AI can achieve amazing results - from reaching super human level while playing games to solving real life problems (robotics, healthcare, etc).

Without a doubt it's worth to know and understand it!

And that's why this course has been created.

We will go through multiple topics, focusing on most important and practical details. We will start from very basic information, gradually building our understanding, and finally reaching the point where we will make our agent learn in human-like way - only from video input!

What's important - of course we need to cover some theory - but we will mainly focus on practical part. Goal is to understand WHY and HOW.

In order to evaluate our algorithms we will use environments from - very popular - OpenAI Gym. We will start from basic text games, through more complex ones, up to challenging Atari games

What will be covered during the course ?

- Introduction to Reinforcement Learning

- Markov Decision Process

- Deterministic and stochastic environments

- Bellman Equation

- Q Learning

- Exploration vs Exploitation

- Scaling up

- Neural Networks as function approximators

- Deep Reinforcement Learning

- DQN

- Improvements to DQN

- Learning from video input

- Reproducing some of most popular RL solutions

- Tuning parameters and general recommendations

See you in the class!

Who this course is for:

Anyone interested in artificial intelligence, data science, machine learning, deep learning and reinforcement learning.

Reinforcement Learning with Pytorch

What you'll learn

Explore related topics

Course content

Welcome to the course3 lectures • 4min

Introduction9 lectures • 44min

Tabular methods19 lectures • 2hr 8min

Scaling up12 lectures • 1hr 21min

DQN9 lectures • 59min

DQN Improvements5 lectures • 35min

DQN with video output11 lectures • 1hr 20min

Final notes1 lecture • 2min

Requirements

Description

Who this course is for: