[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre-trained weights

C++ 84 9 Updated Oct 16, 2024

bytedance / GR-1

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

Python 210 10 Updated Apr 22, 2024

atong01 / conditional-flow-matching

TorchCFM: a Conditional Flow Matching library

Python 1,390 117 Updated Jan 3, 2025

lucidrains / transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 875 35 Updated Jan 6, 2025

TRI-ML / prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 530 283 Updated Jul 4, 2024

BaiShuanghao / Awesome-Robotics-Manipulation

A comprehensive list of papers about Robot Manipulation, including papers, codes, and related websites.

224 13 Updated Dec 23, 2024

Alpha-VLLM / LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Python 2,745 176 Updated May 24, 2024

DeepLink-org / dlinfer

Python 30 10 Updated Dec 30, 2024

AutonoBot-Lab / BestMan_Pybullet

Codebase for the BestMan Mobile Manipulator Platform

Python 191 11 Updated Dec 17, 2024

thu-ml / RoboticsDiffusionTransformer

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 759 69 Updated Dec 24, 2024

Unity-Technologies / Unity-Robotics-Hub

Central repository for tools, tutorials, resources, and documentation for robotics simulation in Unity.

C# 2,089 433 Updated Nov 26, 2024

JadeCong / Awesome-Robot-Learning

Awesome Lists about Robot Learning.

62 2 Updated Nov 28, 2024

RifleZhang / LLaVA-Hound-DPO

Python 133 20 Updated Oct 31, 2024

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,534 202 Updated Dec 5, 2024

CyberOrigin2077 / Cyber

This repo is designed for General Robotic Operation System

Jupyter Notebook 131 21 Updated Nov 18, 2024

HaoxuHuang / copa

Official implementation of CoPa: General Robotic Manipulation through Spatial Constraints of Parts with Foundation Models

Python 43 4 Updated Oct 18, 2024

Ericcsr / ARCap

Data collection part for ARCap

Jupyter Notebook 54 5 Updated Dec 21, 2024

gpt-omni / mini-omni2

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

Python 1,736 206 Updated Nov 6, 2024

google-research / planet

Learning Latent Dynamics for Planning from Pixels

Python 1,182 202 Updated Mar 24, 2023

openreasoner / openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,396 113 Updated Dec 26, 2024

GAIR-NLP / O1-Journey

O1 Replication Journey: A Strategic Progress Report – Part I

1,795 56 Updated Nov 30, 2024

leggedrobotics / rsl_rl

Fast and simple implementation of RL algorithms, designed to run fully on GPU.

Python 787 207 Updated Dec 20, 2024

ChopinChen chopinchenx

Lists (8)

chat-bot

database

doc-parse

Embodied Agent

Embodied Planning

LLM Planning

Open Vocabulary Object Detection

rec-sys

Stars