
In this section, I introduce you to machine learning. We define the term and we learn about the three different types of machine learning: supervised, unsupervised and reinforcement learning.
In this section, I introduce you to reinforcement learning, including a history of the field. You will learn the meaning of 5 key terms: agent, environment, state, actions and reward.
In this section, I introduce you to the NChain game. Our challenge is to create an agent intelligent enough to solve this game.
For Windows users, I show you how to use Google Colab to run your code in the cloud.
In this section, I show Mac and Linux users how to install Python 3 and Sublime, as well as how to use the Terminal. If you are using Windows, use Google Colab instead.
In this section, I show Mac and Linux users how to instal the Open AI gym library. The library is not compatible with Windows, so if you use Windows use Google Colab instead.
We get familiar with the NChain game, by creating an agent which randomly explores..
We refactor our code into a class, ready for the next section.
We give the agent a memory using a reward table, which maps the cumulative reward for each state-action combination.
We create an agent, which chooses the highest reward. We find it gets locked into a sub-optimal strategy. It is not intelligent enough to accept delayed gratification to gain the long-term reward. Building this low intelligence agent helps us understand the challenge we have set ourselves, and leads into a better approach, using Q Learning. By contrasting the two approaches we can understand why Q Learning is so revolutionary.
In this section, I give you further resources if you'd like to learn more.
This course is designed for beginners to machine learning. Some of the most exciting advances in artificial intelligence have occurred by challenging neural networks to play games. I will introduce the concept of reinforcement learning, by teaching you to code a neural network in Python capable of delayed gratification.
We will use the NChain game provided by the Open AI institute. The computer gets a small reward if it goes backwards, but if it learns to make short term sacrifices by persistently pressing forwards it can earn a much larger reward. Using this example I will teach you Deep Q Learning - a revolutionary technique invented by Google DeepMind to teach neural networks to play chess, Go and Atari.