
This course includes our updated coding exercises so you can practice your skills as you learn.
See a demo
The Bellman equation defines q(s,a) as the immediate reward plus the discounted maximum q-value of the next state over all possible actions.
Define reinforcement learning problems by specifying environment, states, agent, actions, and a reward function; update Q values via the equation, illustrated by navigation grid and job scheduling.
Maze-based reinforcement learning project on a 6x6 grid features a red circle agent that uses four directions to reach the goal, with rewards +1 for goal and -1 for holes.
Create a reinforcement learning project for a maze problem using a pie chart editor, choosing Q-learning, Python 3.6, and creating a folder in My Documents.
Create a maze grid by building a canvas, drawing vertical and horizontal lines, defining the origin, and placing the start, walls, goal circle, and a red explorer rectangle.
Test and debug the build_maze method by using the main block and main loop to run the application window, ensuring the function executes correctly and the program starts.
Implement the choose action function with an epsilon-greedy strategy to balance exploration and exploitation, and select either a random action or the best state-action.
Build a stock trading agent with a deep Q network, using a sliding window of closing prices to define states and actions like buy, sell, or hold, aiming for profit.
Create training app guides implementing a reinforcement learning training loop by configuring window size and data, running episodes, learning from market interactions, and saving the trained model.
Explore reinforcement learning fundamentals, where agents learn from rewards through interaction with an unknown environment, and see how deep learning enhances large state-action spaces with prioritized experience replay.
In this course we learn the concepts and fundamentals of reinforcement learning, it's relation to artificial intelligence and machine learning, and how we can formulate a problem in the context of reinforcement learning and Markov Decision Process. We cover different fundamental algorithms including Q-Learning, SARSA as well as Deep Q-Learning. We present the whole implementation of two projects from scratch with Q-learning and Deep Q-Network.