- 6.1. The MAB Problem
- 6.2. Creating Bandit in the Gym
- 6.3. Epsilon-Greedy
- 6.4. Implementing Epsilon-Greedy
- 6.5. Softmax Exploration
- 6.6. Implementing Softmax Exploration
- 6.7. Upper Confidence Bound
- 6.8. Implementing UCB
- 6.9. Thompson Sampling
- 6.10. Implementing Thompson Sampling
- 6.11. Applications of MAB
- 6.12. Finding the Best Advertisement Banner using Bandits
- 6.13. Contextual Bandits
06. Case Study: The MAB Problem
Folders and files
Name | Name | Last commit date | ||
---|---|---|---|---|
parent directory.. | ||||