Name		Name	Last commit message	Last commit date
parent directory ..
configs		configs
testdata		testdata
README.md		README.md
__init__.py		__init__.py
collect_eval.py		collect_eval.py
cross_entropy.py		cross_entropy.py
ddpg_graph.py		ddpg_graph.py
episode_to_transitions.py		episode_to_transitions.py
gin_imports.py		gin_imports.py
grasping_env.py		grasping_env.py
grasping_setup.png		grasping_setup.png
input_data.py		input_data.py
kuka.py		kuka.py
policies.py		policies.py
policies_test.py		policies_test.py
q_graph.py		q_graph.py
requirements.txt		requirements.txt
run.sh		run.sh
run_env.py		run_env.py
run_env_test.py		run_env_test.py
run_random_collect_oss.sh		run_random_collect_oss.sh
run_train_collect_eval.py		run_train_collect_eval.py
run_train_collect_eval_oss.sh		run_train_collect_eval_oss.sh
schedules.py		schedules.py
schedules_test.py		schedules_test.py
tf_critics.py		tf_critics.py
tf_modules.py		tf_modules.py
train_collect_eval.py		train_collect_eval.py
train_collect_eval_test.py		train_collect_eval_test.py
train_ddpg.py		train_ddpg.py
train_q.py		train_q.py
writer.py		writer.py

README.md

Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparison of Off-Policy Methods

This codebase implements learning algorithms and experiments from Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparison of Off-Policy Methods (ICRA 2018).

If you use this codebase for your research, please cite the paper:

@article{quillen2018deep,
  title={Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods},
  author={Quillen, Deirdre and Jang, Eric and Nachum, Ofir and Finn, Chelsea and Ibarz, Julian and Levine, Sergey},
  journal={IEEE International Conference on Robotics and Automation},
  year={2018}
}

Features

Several grasping environments with varying degrees of grasping difficulty.
Customizable DQL, MC, Supervised, Corr-MC, DDPG, PCL algorithms.
MC returns and elibility traces for biased returns.
Bash scripts for gathering data from random policies and running synchronous on-policy or off-policy experiments that alternate between training and evaluation.
Scripts to run grid search over hyperparameters.

Getting Started

The recommended way to set up these experiments is via a virtualenv

sudo apt-get install python-pip
python -m pip install --user virtualenv
python -m virtualenv ~/env
source ~/env/bin/activate

Then install the project dependencies in that virtualenv:

pip install -r dql_grasping/requirements.txt

The first step is then to collect off-policy grasping data with a random policy.

sh dql_grasping/run_random_collect_oss.sh

Then you can train with onpolicy re-collection. By default this runs Deep Q-Learning on the env_procedural environment.

sh dql_grasping/run_train_collect_eval_oss.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dql_grasping

dql_grasping

README.md

Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparison of Off-Policy Methods

Features

Getting Started

Files

dql_grasping

Directory actions

More options

Directory actions

More options

Latest commit

History

dql_grasping

Folders and files

parent directory

README.md

Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparison of Off-Policy Methods

Features

Getting Started