
Learn reinforcement learning by building eight ai agents with python, from q-learning basics to deep q-networks, tackling Space Invaders, stock trading, and road tasks.
Set up gym environment in Jupiter and load Space Invaders with RAM or pixel mode. Run episodes with random actions to observe state and rewards for convolutional neural network learning.
Build a reinforcement learning neural network with a TensorFlow sequential model, conv2d and dense layers, using adam optimizer and input shaped height by width by channels to output six actions.
Build a deep Q network agent for reinforcement learning by configuring memory and policies. Use epsilon-greedy action selection and a dueling network with a linear policy.
Test the untrained agent in the environment and visualize performance to observe reinforcement learning progress. Then train more, add convolution layers, adjust learning rate, and save or load the model.
Analyze the trained agent's performance by calculating and printing the average reward per thousand episodes, track rewards over time, and observe learning progression as the model improves.
Set up a Flappy RL project folder, install the Flappy Bird Gym RAM representation, configure Visual Studio Code, and prepare to build a deep Q network reinforcement learning agent.
Import dependencies for the Flappy Bird RL agent, including random, numpy as np, gym, deque memory, keras layers and models, and TensorFlow setup, then build a deep q-network class.
Create the agent class as a blueprint brain with environment, observation and action spaces, memory, gamma, epsilon decay, epsilon min, and batch size.
Implement the learned function by sampling a mini-batch from memory, building state and next state arrays, and applying deep Q-learning with a target network to train and save the model.
Implement a save function to store the model when the score exceeds a thousand, naming it 'flappy brain'. Visualize how the reinforcement learning agent learns to jump.
Import dependencies for mario bot, including TensorFlow, numpy, a deque memory buffer, and a convolutional neural network with dense, activation, flatten, conv2d, and pooling layers for learning from screen pixels.
Construct the DeQuan agent class for a Mario agent, define state and action spaces, initialize memory (maxlen 5000) and epsilon-greedy parameters to reduce oscillation via main and target networks.
Build a convolutional neural network for a deep q network using a sequential model with conv layers, relu, flattening, dense layers, mapping to the action space, main and target networks.
Implement an update epsilon function that decays over episodes, then build a training function using mini-batches to predict targets and fit the model, with a Flappy Bird example.
Define a pre_process_state function that converts 240x256x3 frames to 88x80x1 grayscale arrays, producing a compact input for reinforcement learning agents.
Configure a deep Q network training loop with million episodes, 400000 timesteps, and 64 batch size, preprocessing states to 80x88x1, train on memory after warm-up, update epsilon and target network.
Create a better learning environment for the agent by gating actions to occur only when the y position shows it is on the ground and using a stuck buffer.
Implement save and load functions to persist the main and target networks’ weights, enabling ongoing training and later visualization, with cloud GPU access for training in the cloud.
Visualize the agent by predicting an action with no epsilon, using a neural network to compute q-values and select the best action.
Install the gym any trading environment and stable baselines, create a project directory, and launch a notebook to build a stock trading agent using S&P 500 with buy and sell.
Import essential packages for reinforcement learning, including gym and a trading package. Use stable baselines with A to C, and numpy, pandas, and matplotlib; next, import data from Yahoo Finance.
Acquire S&P 500 data from Yahoo Finance and MarketWatch, read and prepare a CSFI file in Python using pandas to train a reinforcement learning model on historical trading patterns.
Preprocess MarketWatch data for a trading environment by converting dates to datetime, sorting in ascending order, and setting the date as the index.
Prepare the stock data by casting columns to float and removing commas, then create a stocks-v0 environment with frame_bound five to two hundred and a window size for buy/sell signals.
Reset the state, take random actions in the environment, step through the data, and visualize the random trading actions with red signals for short and green signals for long.
Create a custom reinforcement learning environment by selecting a subset of 89 technical indicators, building a gym trading environment, and processing data features for stable agent learning.
Create a custom environment with technical indicators, train a policy-based reinforcement learning model for stock data, and visualize profits to evaluate performance before moving to a live market gym environment.
Import and install key packages like gym, highway, numpy, and stable baselines to enable reinforcement learning with ready-made algorithms; in the next video, build the environment and model.
Set up the roundabout gym environment and build a PPO MLP policy to train an agent, then run random action tests and prepare for learning with stable baselines.
Train a PPO-based model with an MLP policy, wrap the environment, and monitor entropy and losses; save, load with stable baselines, then visualize the agent through rendered episodes.
Create an intelligent agent that learns to park by setting up a parking gym environment and analyzing random actions to understand performance before training.
Visualize the parking agent with the same prediction code, train it to exhibit intelligent behavior, then proceed to merging onto the highway in the next lesson.
Train a highway merging reinforcement learning agent in OpenAI Gym. Compare a deep cue network with an actor-critic model using the p o algorithm and stable baselines.
Join the most comprehensive Reinforcement Learning course on Udemy and learn how to build Amazing Reinforcement Learning Applications!
Do you want to learn how to build cutting edge trading algorithms that leverage todays technology? Or do you want to learn the tools and skills that are considered the state of the art of Artificial Intelligence? Or do you just want to learn Reinforcement Learning in a Highly practical way?
After completing this course you will be able to:
Build any reinforcement learning algorithm in any environment
Use Reinforcement Learning for your own scientific experiments
Solve problems using Reinforcement Learning
Leverage Cutting Edge Technologies for your own project
Master OpenAI gym's
Why should you choose this course?
This course guides you through a step-by-step process of building state of the art trading algorithms and ensures that you walk away with the practical skills to build any reinforcement learning algorithm idea you have and implement it efficiently.
Here's what's included in the course:
Atari Reinforcement Learning Agent
Build Q-Learning from scratch and implement it in Autonomous Taxi Environment
Build Deep Q-Learning from scratch and implement it in Flappy Bird
Build Deep Q-Learning from scratch and implement it in Mario
Build a Stock Reinforcement Learning Algorithm
Build a intelligent car that can complete various environments
And much more!
This course is for you if ...
You're interested in cutting edge technology and applying it in practical ways
You're passionate about Deep Learning/AI
Want to learn about cutting-edge technologies!
Want to learn reinforcement learning by doing cool projects!
Course prerequisites:
Python!