Embodied Executable Policy Learning with Language-based Scene Summarization.
In NAACL 2024
If you feel our code or models help your research, kindly cite our paper:
@article{Qiu2023EmbodiedEP,
title={Embodied Executable Policy Learning with Language-based Scene Summarization},
author={Jielin Qiu* and Mengdi Xu* and William Jongwon Han* and Seungwhan Moon and Ding Zhao},
booktitle={NAACL},
year={2024}
}
The curated dataset can be found here
Preprocessing scripts are in preprocess_utils.py, preprocess_prompt.py, and preprocess_video.py
Create a virtual environment and activate it.
python -m venv .env
source .env/bin/activate
Install basic requirements.
pip install -r requirements.txt
All customizable configurations are in schema.py
To finetune or evaluate the SUM or APM model, please see main.py and add your desired arguments. You can also choose your desired learning paradigm (supervised/REINFORCE) in main.py.
This project is licensed under the CC BY-NC-SA License.