Skip to content

Pytorch implementation of PPO-Lagrangian, compared against PPO in a continous action Cart Pole environment.

License

Notifications You must be signed in to change notification settings

HaozheTian/Torch-PPO-Lagrangian

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PPO-Lagrangian

This repository provides PyTorch implementations for PPO [Schulman et al, 2017] and PPO-Lagrangian [Ray et al, 2019]. An adaptation of the gym Cart Pole environment with continuous action space is also implemented.

Background

Pseudocode

Requirements

This repository has been tested on Ubuntu 22.04.

gymnasium=0.29.1
matplotlib=3.8.3
numpy=1.26.4
tensorboard=2.16.2
torch=2.2.0
tqdm=4.66.2

Result

Comparison

About

Pytorch implementation of PPO-Lagrangian, compared against PPO in a continous action Cart Pole environment.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published