This repo records my answers to all questions from the excercises of CS229 (Autumn 2017).
I tried to record all details in Jupyter notebooks. If you see any mistake, please let me know by opening a new issue.
As for reinforcement learning, I've also implemented value iteration, policy iteration, SARSA, and Q-learning before in javascript for the gridworld at with a web demo at
You might also be interested in an earlier version of cs229,
This project is considered complete.