PPO implementation code in PyTorch imported from PyTorch PPO.
README.md
main.py
mymodels.py
plottheresults.py
./a2c_ppo_acktr/arguments.py
./plots/rewardbp.py
- PyTorch
- TensorFlow
- Open AI Gym
- pybullet
python main.py --env-name 'BipedalWalkerHardcore-v2' --use_tdm True --beta_int 10.0 --num_layers 2 --fc_width 300 --opt_lr 1e-4 --beta_schedule linear --bonus_func log