Skip to content

MinkyungPark/PORelDICE

 
 

Repository files navigation

PORelDICE

This is the implementation for our paper Relaxed Stationary Distribution Correction Estimation for Improved Offline Policy Optimization in Jax.

This codebase is built upon IQL and SQL repository.

Installation

$ conda create -c nvidia -n PORelDICE python=3.8 cuda-nvcc=11.3
$ conda activate PORelDICE
$ pip install -r requirements.txt --no-deps

Run Experiments

Mujoco

$ ./run_mujoco.sh

Antmaze

$ ./run_antmaze.sh

Kitchen

$ ./run_kitchen.sh

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 93.9%
  • Shell 6.1%