Stars
Official implementation for paper "Adaptive Compliance Policy Learning Approximate Compliance for Diffusion Guided Control".
Official repo and evaluation implementation of VSI-Bench
The suite of modeling video with Mamba
Simulation studies for research "Tac-Man: Tactile-Informed Prior-Free Manipulation of Articulated Objects".
Tactile Sensing and Simulation; Visual Tactile Manipulation; Open Source.
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
A generative world for general-purpose robotics & embodied AI learning.
Awesome papers & datasets specifically focused on long-term videos.
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
Python sample codes for robotics algorithms.
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Awesome resources on normalizing flows.
A State-Space Model with Rational Transfer Function Representation.
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
[arXiv 2024] Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies. Part 1: Train & Deploy of iDP3
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"
DelinQu / SimplerEnv-OpenVLA
Forked from simpler-env/SimplerEnvEvaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo, and OpenVLA) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)