Skip to content

Tutorial with basic Q-learning and policy gradient

Notifications You must be signed in to change notification settings

dosovits/RL_tutorial

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RL tutorial

Tutorial with basic Q-learning and policy gradient

Dependencies

  • Python 3.6 (other python3 will work almost certainly, python2 - perhaps after some adjustments)
  • numpy
  • gym (basic version)
  • pytorch 0.4.0

These can be conveniently installed with conda in a conda environment, the basic version of gym is pip-installable.

Acknowledgements

This code is partially based on the tutorial of Arthur Juliani.

About

Tutorial with basic Q-learning and policy gradient

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages