Skip to content

Files

Latest commit

Oct 11, 2022
9e1f95e · Oct 11, 2022

History

History
This branch is 106 commits behind huawei-noah/HEBO:master.

PMDB

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
Oct 11, 2022
Oct 11, 2022
Oct 11, 2022
Oct 11, 2022
Oct 11, 2022
Oct 11, 2022
Oct 11, 2022

Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief

Code to reproduce the experiments in Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief.

Installation

  1. Install MuJoCo 2.0.0 to ~/.mujoco/mujoco200.
  2. Create a conda environment and install requirements.
cd PMDB
conda env create -f PMDB_env.yml
conda activate PMDB_env

Usage

For example, use the following command to run Hopper-medium-v2 benchmark in D4RL.

python main.py --task=hopper-medium-v2

Detailed configuration can be found in config.py.

Logging

By default, TensorBoard logs are generated in the log/ directory.

Citing PMDB

@inproceedings{guo2022pmdb,
  title={Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief},
  author={Kaiyang Guo and Yunfeng Shao and Yanhui Geng},
  booktitle{NeurIPS},
  year={2022}
}