Improving long-horizon decision making with hierarchical goal-conditioned planning

Final Project for UT Reinforcement Learning course (Fall 2019)

Kai-Chi Huang, Wei-Jen Ko

Youtube video: https://youtu.be/c_G16ep3f-I

Report PDF: https://github.com/kevin00036/ut-rl2019/raw/master/RL_Final_Project.pdf

Dependencies:

Python 3.7+
gym (0.15.4)
numpy (1.17.4)
torch (1.3.1 with CUDA)
A machine with a CUDA-compatible GPU

To run codes:

Simply run

python3 test.py

To change environments, change (or uncomment) the environment env on Line 17 in test.py. Note that we currently support discrete-action environments currently.

To switch the TD3 optimization (mitigating maximization bias), toggle the use_td3 variable on Line 24.

To switch between Goal-conditioned RL and standard RL, uncomment the corresponding agent on Line 27-28.

To change the maximum environment steps, change the max_steps variable on Line 40.

The execution log will be save at <project base>/logs/<algorithm_name>/xxxxxxxxxx_yyyyy.json

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.gitignore		.gitignore
README.md		README.md
RL_Final_Project.pdf		RL_Final_Project.pdf
ddpg.py		ddpg.py
dqn.py		dqn.py
dqn_r.py		dqn_r.py
gymtool.py		gymtool.py
planning.py		planning.py
plot.py		plot.py
replay_buffer.py		replay_buffer.py
rl.py		rl.py
stat_logger.py		stat_logger.py
stats.py		stats.py
test.py		test.py
uvfa.py		uvfa.py
uvfa_r.py		uvfa_r.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving long-horizon decision making with hierarchical goal-conditioned planning

Final Project for UT Reinforcement Learning course (Fall 2019)

Youtube video: https://youtu.be/c_G16ep3f-I

Report PDF: https://github.com/kevin00036/ut-rl2019/raw/master/RL_Final_Project.pdf

About

Releases

Packages

Languages

kevin00036/ut-rl2019

Folders and files

Latest commit

History

Repository files navigation

Improving long-horizon decision making with hierarchical goal-conditioned planning

Final Project for UT Reinforcement Learning course (Fall 2019)

Youtube video: https://youtu.be/c_G16ep3f-I

Report PDF: https://github.com/kevin00036/ut-rl2019/raw/master/RL_Final_Project.pdf

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages