You can run this code on your own machine or on Google Colab.
-
Local option: If you choose to run locally, you will need to install MuJoCo and some Python packages; see installation.md from homework 1 for instructions. There are two new package requirements (
opencv-python
andgym[atari]
) beyond what was used in the previous assignments; make sure to install these withpip install -r requirements.txt
if you are running the assignment locally. -
Colab: The first few sections of the notebook will install all required dependencies. You can try out the Colab option by clicking the badges below:
The following files have blanks to be filled with your solutions from homework 1 and 3. The relevant sections are marked with `TODO: get this from Piazza'.
- infrastructure/utils.py
- infrastructure/rl_trainer.py
- policies/MLP_policy.py
- policies/argmax_policy.py
- critics/dqn_critic.py
You will then need to implement code in the following files:
The relevant sections are marked with TODO
.
You may also want to look through scripts/run_hw5_expl.py (if running locally) or run_hw5_expl.ipynb (if running on Colab), though you will not need to edit this files beyond changing runtime arguments in the Colab notebook.
See the assignment PDF for more details on what files to edit.
For this particular assignment, you will need to install networkx==2.5