- 斯坦福 cs234 强化学习中文讲义
- Lecture 1 Introduction to Reinforcement Learning
- Lecture 3 Model Free Policy Evaluation: Policy Evaluation Without Knowing How the World Works
- Lecture 4 Model Free Control
- Lecture 5 Value Function Approximation
- Lecture 6 CNNs and Deep Q-learning
- Lecture 7 Imitation Learning
- Lecture 8&9 Policy Gradient
- Lecture 10 Advanced Policy Gradient
- Lecture 11&12 Exploration and Exploitation
- Lecture 14 Model Based RL, Monte-Carlo Tree Search