Reinforcement Learning tutorial By: Juan Pablo Martínez Piazuelo Daniel Esteban Ochoa Tamayo This tutorial is organized as follows: Tabular Q-learning (gridworld) Double Deep Q-learning (gridworld) Deep Deterministic Policy Gradient (continous control) (under developement) The files of the tutorial are under the folder numpy-based Launch tutorial: With binder (recommended): With Azure notebooks: