Hindsight Experience Replay

This repository is the Pytorch implementation of Hindsight Experience Replay

Pseudocode of the HER algorithm

Set up

Install dependencies with Docker. You can also install dependencies using requirements.txt.

To build Docker image, run this command.

# format: docker build -t . <image_name>
docker build -t . her

After building image, use the following command to run the Docker container.

docker run -ti --gpus '"device='<gpu number>'"' -v <your working directory>:/app --ipc=host --name <container_name> <image_name> /bin/bash

# or you can run this command after changing docker_run.sh file in proper format
./docker_run.sh <gpu num> <container_name>

Train

If you want to train the agent with HER algorithm in BitFlip environment, run this command inside the Docker container.

python train_bitflip.py --config config_bitflip.yaml

You can freely change the hyperparameter if you needed. I tested n_bits in [10, 25, 45] with same hyperparameters except n_bits / max_episode_steps / hidden_units. The paper mentioned that nbits < 50 can be trained.

Test

You can test with the pretrained networks. All pretrained networks for 10bits/25bits/45bits can be downloaded in this link

To see the bitflip simulation result with the network, run

# before run this command, you should put the path to checkpoint in config file.
python render_bitflip.py --config config_bitflip.yaml

Results

The bitflip simulation results with 10 bits.

The bitflip simulation results with 25 bits.

The bitflip simulation results with 45 bits.

The training logs of DDQN + HER with 10bits.

The training logs of DDQN + HER with 25bits.

The training logs of DDQN + HER with 45bits.

The training logs of vanilla DDQN with 10bits.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
imgs		imgs
Dockerfile		Dockerfile
License		License
README.md		README.md
bitflip.py		bitflip.py
buffer.py		buffer.py
config_bitflip.yaml		config_bitflip.yaml
config_bitflip_vanilla.yaml		config_bitflip_vanilla.yaml
docker_run.sh		docker_run.sh
entrypoint.sh		entrypoint.sh
hindsight_buffer.py		hindsight_buffer.py
networks.py		networks.py
render_bitflip.py		render_bitflip.py
requirements.txt		requirements.txt
train_bitflip.py		train_bitflip.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hindsight Experience Replay

Pseudocode of the HER algorithm

Set up

Train

Test

Results

Acknowlegement

About

Releases 2

Packages

Languages

License

HAN-oQo/Hindsight_Experience_Replay

Folders and files

Latest commit

History

Repository files navigation

Hindsight Experience Replay

Pseudocode of the HER algorithm

Set up

Train

Test

Results

Acknowlegement

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages