Name		Name	Last commit message	Last commit date
Latest commit History 250 Commits
.github/workflows		.github/workflows
datasets/debug_dataset/bridge_dataset/1.0.0		datasets/debug_dataset/bridge_dataset/1.0.0
docs/assets		docs/assets
experiments		experiments
orca		orca
scripts		scripts
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
config.py		config.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
train.py		train.py

Repository files navigation

ORCA

This repo contains code for training and finetuning large robot policies. Currently, ORCA policies are causal transformer models trained on a diverse mix of robot datasets using BC.

We tokenize task definitions (like language instructions or goals), observations (like RGB-D images and proprioception) and actions. Given the sequence of input tokens, the model is trained to predict the action tokens.

Installation

conda create -n orca python=3.10
conda activate orca
pip install -e .
pip install -r requirements.txt

For GPU:

pip install --upgrade "jax[cuda11_pip]==0.4.13" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html

For TPU

pip install --upgrade "jax[tpu]==0.4.13" -f https://storage.googleapis.com/jax-releases/libtpu_releases.html

See the Jax Github page for more details on installing Jax.

Test the installation by training on the debug dataset:

python train.py --config config.py:ci_debug_dataset  --name debug

Training

Data

We use the RLDS data format and provide fast, parallelized data loaders for policy training. To download the datasets please reach out to [email protected] or download datasets directly from the **"Open X-Embodiment" repo.

Base Policy Training

To train foundational ORCA policies, you can follow the example command below. You can modify hyperparameters like dataset, batch size etc. in config.py.

python train.py --config config.py:transformer_bc_bridge --name=orca_bridge --config.dataset_kwargs.data_kwargs_list[0].data_dir=<...> --config.save_dir=<...>

Code Structure

	File	Description
Hyperparameters	config.py	Defines all hyperparameters for the training run.
Training Loop	train.py	Main training script.
Datasets	dataset.py	Functions for creating single / interleaved datasets + data augmentation.
Encoders	tokenizers.py	Tokenizers that encode image / text inputs into tokens.
Model + Objective	orca_policy.py	Sort tokens into sequence, run forward pass, compute loss.
Visualization	visualization_lib.py	Utilities for offline qualitative & quantitative eval.

Contributing

Experimental things and training/eval scripts should go in experiments/<your_name>. To make any changes to files outside of your experiments directory, please open a pull request.

Steps to contribute:

Fork the repo and create your branch from master.
Use pre-commit to enable code checks and auto-formatting.
Test that a basic training starts with the debug dataset with: python experiments/main/train.py --config experiments/main/configs/train_config.py:ci_debug_dataset --name debug

FAQ

Jax complains about wrong CUDA / CuDNN version: Jax picks up on the system CuDNN first (before using the bundled CUDA), so if you encounter version issues please update your system CUDA / CuDNN or remove it so Jax uses the bundled packages

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ORCA

Installation

Training

Data

Base Policy Training

Code Structure

Contributing

FAQ

About

Releases

Packages

Languages

License

omeryagmurlu/octo

Folders and files

Latest commit

History

Repository files navigation

ORCA

Installation

Training

Data

Base Policy Training

Code Structure

Contributing

FAQ

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages