There are three folders and eight other files
levels
contains the code for all levelsrl
contains an implementation of the Proximal Policy Optimization (PPO) RL algorithmutils
contains files for reading and saving logs, demos and models. In this folder,agent.py
defines an abstract class for an agent
arguments.py
contains the value of default arguments shared by both imitation and reinforcement learningbot.py
is a heuristic stack-based bot that can solve all levelsefficiency.py
contains hyperparmeter configurations we use for imitation learning sample efficiencyevaluate.py
contains functions used by IL and RL to evaluate an agentimitation.py
is our imitation learning implementationmodel.py
contains the neural network codeplotting.py
is used in plotting. It also contains Gaussian Process code used in measuring imitation learning sample efficiency