Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Contextual Multi-Armed Bandit Problems in Python
Rating: 3.6 out of 5(12 ratings)
102 students

Contextual Multi-Armed Bandit Problems in Python

All you need to master and apply multi-armed bandit problems into real-world problems
Created byHadi Aghazadeh
Last updated 2/2024
English

What you'll learn

  • Master all essential Bandit Algorithms
  • Learn How to Apply Bandit Problems into Real-world Applications with Focus on Product Recommendation
  • Learn How to Implement All Essential Aspects of Bandit Algorithms in Python
  • Build Different Deterministic and Stochastic Environments for Bandit Problems to Simulate Different Scenarios
  • Learn and Apply Bayesian Inference for Bandit Problems and Beyond as a Byproduct of This Course
  • Understand Essential Concepts in Contextual Bandit Problems
  • Apply Contextual Bandit Problems in a Real-World Product Recommendation Dataset and Scenario

Course content

5 sections70 lectures9h 0m total length
  • Course Overview11:30

    Overview of the course to see the big picture of what will happen.

  • Casino and Statistics5:45

    An almost fun approach on how Casinos shaped the curse of Statistics!

  • Story: A Gambler in Casino2:24

    A story as a introduction to the Multi-armed Bandit Problems!

  • Multi-armed Bandit Problems and Their Applications7:55

    Applications of Multi-armed Bandit Problems!

  • Multi-armed Bandit Problems for Startup Founders3:13

    MAB has many applications in online digital section. This video shows how startups take the advantage of MAB for building customized products for their customers!

  • Similarities and Differences between Bandit Problems and Reinforcement Learning6:21

    An important video on the similarities and differences of RL and MAB.

  • Slides0:03

    Slides for the introduction section!

  • Resources0:07

    Resources that this course is based on them.

  • The most important difference between RL and MAB

Requirements

  • No obligational pre-requisites

Description

Welcome to our course where we'll guide you through Multi-armed Bandit Problems and Contextual Bandit Problems, step by step. No prior experience needed - we'll start from scratch and build up your skills so you can use these algorithms for your own projects.


We'll cover the basics like random, greedy, e-greedy, softmax, and more advanced methods like Upper Confidence Bound (UCB). Along the way, we'll explain concepts like Regret concept instead of just focusing on rewards value in Reinforcement Learning and Multi-armed Bandit Problems. Through practical examples in different types of environments, like deterministic, stochastic and non-stationary environment, you'll see how these algorithms perform in action.


Ever wondered how Multi-armed Bandit problems relate to Reinforcement Learning? We'll break it down for you, highlighting what's similar and what's different.

We'll also dive into Bayesian inference, introducing you to Thompson sampling, both for binary reward and real value reward in simple terms, and use Beta and Gaussian distributions to estimate the probability distributions with clear examples to help you understand the theory and how to put it into practice.


Then, we'll explore Contextual Bandit problems, using the LinUCB algorithm as our guide. From basic toy examples to real-world data, you'll see how it works and compare it to simpler methods like e-greedy.


Don't worry if you're new to Python - we've got you covered with a section to help you get started. And to make sure you're really getting it, we'll throw in some quizzes to test your understanding along the way.


Our explanations are clear, our code is clean, and we've added fun visualizations to help everything make sense. So join us on this journey and become a master of Multi-armed and Contextual Bandit Problems!

Who this course is for:

  • Web Application Developers
  • Researchers working on Action optimization
  • Machine Learning Developers and Data Scientists
  • Startup Enthusiasts Driven to Develop Customized Recommendation Apps.