This set of python files represents a small RL library. It's main point is simplicity - the amount of code should be minimal. It's purpose is to implement algorithms for solving the Udacity RL Nanodegree projects.
This library allows storing results in the Google Cloud Platform. This is beneficial, when we want to use multiple VMs for training, or want to run training for a long period of time.
install.sh
will download all the environements. Python libraries from the Udacity RL project should be installed beforehand. If you want to get other trained checkpoints from my storage bucket, you will also need the gsutils tool for getting files from Google Cloud Platform.
Training is done through iterations. Number of iterations is defined via the iterations
parameter.
Each iteration consists of 2 phases:
- Training. It runs at least
steps
steps; - Evaluation. Runs at least
eval_steps
. During the evaluation phase network is not trained, replay buffer is not updated and the noisy layers don't change their noise parameters. Only the reward is calculated.
For getting a full list of training parameters run python train.py
.
Trained models are stored in the checkpoins
folder and TensorBoard statistics is written into the train
folder.