PMDB

Oct 11, 2022

9e1f95e · Oct 11, 2022

This branch is 106 commits behind huawei-noah/HEBO:master.

Name	Name	Last commit message	Last commit date
parent directory ..
model	model	add PMDB	Oct 11, 2022
module	module	add PMDB	Oct 11, 2022
utils	utils	add PMDB	Oct 11, 2022
PMDB_env.yml	PMDB_env.yml	add PMDB	Oct 11, 2022
README.md	README.md	add PMDB	Oct 11, 2022
config.py	config.py	add PMDB	Oct 11, 2022
main.py	main.py	add PMDB	Oct 11, 2022

README.md

Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief

Code to reproduce the experiments in Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief.

Installation

Install MuJoCo 2.0.0 to ~/.mujoco/mujoco200.
Create a conda environment and install requirements.

cd PMDB
conda env create -f PMDB_env.yml
conda activate PMDB_env

Usage

For example, use the following command to run Hopper-medium-v2 benchmark in D4RL.

python main.py --task=hopper-medium-v2

Detailed configuration can be found in config.py.

Logging

By default, TensorBoard logs are generated in the log/ directory.

Citing PMDB

@inproceedings{guo2022pmdb,
  title={Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief},
  author={Kaiyang Guo and Yunfeng Shao and Yanhui Geng},
  booktitle{NeurIPS},
  year={2022}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

PMDB

PMDB

README.md

Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief

Installation

Usage

Logging

Citing PMDB

Files

PMDB

Directory actions

More options

Directory actions

More options

Latest commit

History

PMDB

Folders and files

parent directory

README.md

Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief

Installation

Usage

Logging

Citing PMDB