GitHub

HOW TO RUN THE TRAINING PROCESS

imitation.py

> the learning rate is fixed after our experiment and the final value is 0.001 with Adam optimizer. 
> to run the model and the training process, just do "python imitation.py"

reinforce.py

> we fix the hyper-parameters like the learning rate, discont rate and the test interval of 500 training epoches.

a2c.py

> we fix the architecture of the critic model and the actor model is adpated from the configuration of the expert model in question 1
> the command line argument can be passed via the flags.

VIDEOs

> the "./video" directory contains the videos of the question 1 and 2
> ./video/video-Q1 contains video clips of the question 1
> ./video/video-Q2 contains video clips of the question 2

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
video/video-Q1		video/video-Q1
LunarLander-v2-config.json		LunarLander-v2-config.json
LunarLander-v2-weights.h5		LunarLander-v2-weights.h5
a2c.py		a2c.py
imitation.py		imitation.py
readme.md		readme.md
reinforce.py		reinforce.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HOW TO RUN THE TRAINING PROCESS

imitation.py

reinforce.py

a2c.py

VIDEOs

About

Releases

Packages

Languages

LuiCB/LunarLander

Folders and files

Latest commit

History

Repository files navigation

HOW TO RUN THE TRAINING PROCESS

imitation.py

reinforce.py

a2c.py

VIDEOs

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages