This is the implementation for our paper Relaxed Stationary Distribution Correction Estimation for Improved Offline Policy Optimization in Jax.
This codebase is built upon IQL and SQL repository.
$ conda create -c nvidia -n PORelDICE python=3.8 cuda-nvcc=11.3
$ conda activate PORelDICE
$ pip install -r requirements.txt --no-deps
Mujoco
$ ./run_mujoco.sh
Antmaze
$ ./run_antmaze.sh
Kitchen
$ ./run_kitchen.sh