Skip to content

Latest commit

 

History

History
 
 

06. Case Study: The MAB Problem

6. Case Study: The MAB Problem

  • 6.1. The MAB Problem
  • 6.2. Creating Bandit in the Gym
  • 6.3. Epsilon-Greedy
  • 6.4. Implementing Epsilon-Greedy
  • 6.5. Softmax Exploration
  • 6.6. Implementing Softmax Exploration
  • 6.7. Upper Confidence Bound
  • 6.8. Implementing UCB
  • 6.9. Thompson Sampling
  • 6.10. Implementing Thompson Sampling
  • 6.11. Applications of MAB
  • 6.12. Finding the Best Advertisement Banner using Bandits
  • 6.13. Contextual Bandits