MaxMI

This is the official repository for: MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery

Installation

Clone the repository:

git clone https://github.com/PeiZhou26/MaxMI.git
cd MaxMI

Create the Conda environment using the environment.yml file:
```
conda env create -f environment.yml
```
Activate the environment:
```
conda activate maxmi
```

Tasks

The current code supports four tasks from the ManiSkill2 (v0.4.2) benchmark: PickCube-v0, StackCube-v0, PegInsertionSide-v0, and TurnFaucet-v0.

Data Preparation

The behavior cloning datasets can be accessed via this link. Each task includes approximately 1,000 successful demonstrations; however, we use a randomly sampled subset of 500 for our experiments.

After downloading the datasets, place them in the /data directory. To evaluate the intermediate task success rate, the ManiSkill2 environment requires patching (see /maniskill2_patches for details).

For further information, please refer to the CoTPC repository and official ManiSkill2 documentation.

Training & Evaluation

For key state discovery, which involves a differentiable mutual information estimator, we utilize the off-the-shelf InfoNet. The parameters of InfoNet are kept frozen. Download the pretrained InfoNet model and place the checkpoint in your directory. Then, update the checkpoint path in /src/infer_infonet.py with your own path.

The script /src/concept_train.py provides an example of key state discovery and saves the trained key state localization network. After training, use /src/concept_eval.py to label key states from the demonstrations and store the key state labels in a .pkl file.

 python /src/concept_train.py

After obtaining the automatically labeled key states, we use them to train a manipulation policy for each task. We build on Chain-of-Thought Predictive Control (CoTPC) as the foundation of our policy, which simultaneously optimizes both key state prediction and next action prediction. To train the policy, use /src/train.py, and to evaluate the performance of the trained policy, use /src/eval.py. For detailed examples of training and testing, refer to /scripts/train.sh and /scripts/eval.sh.

 bash /scripts/train.sh

Acknowledgement

We would like to express our gratitude to CoTPC and InfoNet for providing the code base that significantly assisted in the development of our program.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MaxMI

Installation

Tasks

Data Preparation

Training & Evaluation

Acknowledgement

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
maniskill2_patches		maniskill2_patches
scripts		scripts
src		src
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

License

PeiZhou26/MaxMI

Folders and files

Latest commit

History

Repository files navigation

MaxMI

Installation

Tasks

Data Preparation

Training & Evaluation

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages