Skip to content

Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/

License

Notifications You must be signed in to change notification settings

Lauren-Stumpf/tqc_pytorch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Controlling Overestimation Bias with Truncated Mixture of ContinuousDistributional Quantile Critics

PyTorch implementation of Truncated Quantile Critics (TQC).

Requirements

Create anaconda environment from provided environment.yaml file:

conda env create -f environment.yml 

It essentially consists of gym==0.12.5, mujoco-py==1.50.1.68, pytorch=1.3.0, torchvision=0.2.1.

Environment contains mujoco-py library which may require to install additional libraries depending on OS.

Usage

Experiments on single environments can be run by calling from created environment:

conda activate tqc
python main.py --env HalfCheetah-v2

Hyper-parameters can be modified with different arguments to main.py.

About

Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%