Practical_RL/week9_policy_II at master · learcane/Practical_RL

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
seminar_TRPO_pytorch.ipynb		seminar_TRPO_pytorch.ipynb
seminar_TRPO_tensorflow.ipynb		seminar_TRPO_tensorflow.ipynb
seminar_TRPO_theano.ipynb		seminar_TRPO_theano.ipynb

README.md

This section covers some steroids for policy gradient methods, along with a cool general trick called

Go to seminar_TRPO_<framework>.ipynb and follow instructions in the notebook.

While you already know algorithms that will work with continuously many actions, it can't hurt to learn something more specialized.