Reinforcement Learning tutorial

By:

This tutorial is organized as follows:

Part 0: python/numpy introduction.
Part 1: tabular Q-learning (gridworld).
Part 2: Double Deep Q-learning (gridworld).
Part 3: Deep Deterministic Policy Gradient (continous control) (under developement).

The files of the tutorial are under the folder numpy-based

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
binder		binder
numpy-based		numpy-based
.gitignore		.gitignore
README.md		README.md

Provide feedback