Self-driving go-kart with Unity-ML

Name: Self-driving go-kart with Unity-ML
Rating: 4.5 (275 reviews)

Deep learning applied to a self-driving car simulation

Created byFabrizio Frigeni

Last updated 1/2019

English

What you'll learn

Configure and use the Unity Machine Learning Agents toolkit to solve physical problems in simulated environments
Understand the concepts of neural networks, supervised and deep reinforcement learning (PPO)
Apply ML control techniques to teach a go-kart to drive around a track in Unity

Course content

7 sections • 42 lectures • 1h 48m total length

Introduction0:52
What is the class about?
Table of Content1:16
TOC: basics of self-driving cars, PID controller, imitation learning, reinforcement learning, Unity ML
Prerequisites0:52
Basic math and programming skills are required to follow the class.
Unity is the tool we use for simulations, so it helps to go through the free online tutorials in case you are not familiar with it yet: https://unity3d.com/learn/tutorials

Self-driving go-kart project in Unity3:44
Download and explore the template project.
[last saved with Unity version: 2018.2.13]
Machine Learning Brains3:06
Understand what all the different kinds of brains are used for
Control Scripts2:16
Explore the relevant scripts used in the project
Setup the ML-Agents Toolkit2:43
All updated details for setting up the environment can be found here: https://github.com/Unity-Technologies/ml-agents
Specifically, installation: https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Installation.md
and basic guide: https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Basic-Guide.md
TensorFlowSharp plugin: https://s3.amazonaws.com/unity-ml-agents/0.5/TFSharpPlugin.unitypackage

Traditional Control1:19
Let's start from the easiest control method: hard-coded rules
PID Controller3:32
Introducing the generic PID controller
Tuning1:40
Fine-tuning the parameters
Heuristic brain in Unity2:33
How to implement the PID controller in Unity ML
Cross-track error2:48
How to calculate the cross-track error of the go-kart
Testing the PID2:16
Let the PID control the go-kart
Improvements2:23
Collection of tips to improve on the vanilla PID design
Model-based control3:28
Why adding a prediction model helps
Onto Machine Learning1:45
Limitations of traditional control

Why Machine Learning1:57
Why do we need machine learning in some cases
What kinds of learning2:26
Supervised vs. Reinforcement Learning
Neural Networks3:57
Basic introduction to neural networks
NN Details1:56
A bit deeper in the details of neural networks
Training a NN2:56
How do you train a neural network
Optimizer5:09
Gradient descent techniques
Convolutional layers3:34
Introducing convolutional layers
Transfer learning2:09
Shortcutting the training process.
Some pre-trained models can be found here for Keras: https://keras.io/applications/
Imitation learning in Unity3:27
Configure the teacher and student brains
Training the go-kart via IL1:30
Show the go-kart how to race!
Testing the drive1:49
See if the go-kart has learned well
Tips on imitation learning2:52
A few tips for best performance

Reinforcement Learning2:16
Why self-learning is important
Nomenclature2:17
Environment, agents, observations, actions, reward
Initial state2:28
Starting from scratch vs. pre-injecting knowledge.
NOTE: The current version of the Unity ML agents only offers the option to start from empty brains, because the graphs saved after imitation learning is not compatible with that used in reinforcement learning. This should change in the future: stay updated through the official documentation!
Training a policy1:45
Different ways a policy can be trained.
For a combined approach between perturbing actions space and parameters space see this:
https://blog.openai.com/better-exploration-with-parameter-noise/
The PPO algorithm5:13
Proximal Policy Optimization:
https://blog.openai.com/openai-baselines-ppo/
https://arxiv.org/abs/1707.06347
More in general on policy gradients: http://www.scholarpedia.org/article/Policy_gradient_methods
Evolutional Strategies3:51
Detour on genetic algorithms, in particular evolution strategies:
https://blog.openai.com/evolution-strategies/
https://arxiv.org/abs/1703.03864
Reward2:14
Crafting a reward function
Training the go-kart with RL2:12
Let the go-kart drive on its own...
Tensorboard analysis1:49
Using Tensorboard for detailed analysis of the training results
Testing results0:55
See what the go-kart was able to learn on its own!
Final tips4:11
A few ways to improve training further

Requirements

Basic algebra and basic programming skills

Description

WARNING: take this class as a gentle introduction to machine learning, with particular focus on machine vision and reinforcement learning. The Unity project provided in this course is now obsolete because the Unity ML agents library is still in its beta version and the interface keeps changing all the time! Some of the implementation details you will find in this course will look different if you are using the latest release, but the key concepts and the background theory are still valid. Please refer to the official migrating documentation on the ml-agents github for the latest updates.

Learn how to combine the beauty of Unity with the power of Tensorflow to solve physical problems in a simulated environment with state-of-the-art machine learning techniques.

We study the problem of a go-kart racing around a simple track and try three different approaches to control it: a simple PID controller; a neural network trained via imitation (supervised) learning; and a neural network trained via deep reinforcement learning.

Each technique has its strengths and weaknesses, which we first show in a theoretical way at simple conceptual level, and then apply in a practical way. In all three cases the go-kart will be able to complete a lap without crashing.

We provide the Unity template and the files for all three solutions. Then see if you can build on it and improve performance further more.

Buckle up and have fun!

Who this course is for:

Students interested in a quick jump into machine learning, focusing on the application rather than the theory
Engineers looking for a machine learning realistic simulator

Self-driving go-kart with Unity-ML

What you'll learn

Explore related topics

Course content

Introduction3 lectures • 3min

Self-driving cars1 lecture • 5min

Unity Machine Learning Agents4 lectures • 12min

Traditional Control9 lectures • 22min

Imitation Learning12 lectures • 34min

Reinforcement Learning11 lectures • 29min

Conclusion2 lectures • 4min

Requirements

Description

Who this course is for: