Lists (14)
Sort Name ascending (A-Z)
Stars
Real time 3D hand pose estimation using MediaPipe
NVIDIA Isaac GR00T N1 is the world's first open foundation model for generalized humanoid robot reasoning and skills.
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
One-Shot Affordance Grounding of Deformable Objects in Egocentric Organizing Scenes
MANO hand model in PyTorch (anatomy consistent, anchors, etc)
[ICLR'25] 🍀 DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References
One-Shot Open Affordance Learning with Foundation Models (CVPR 2024)
[ECCV 2024 Best Paper Candidate] PointLLM: Empowering Large Language Models to Understand Point Clouds
Code for my paper "You Only Teach Once: Learn One-Shot Bimanual Robotic Manipulation from Video Demonstrations"
[CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos
Awesome-LLM: a curated list of Large Language Model
Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
[NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"
Code release of our paper "DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation".
A generative world for general-purpose robotics & embodied AI learning.
Official Code for "SingularTrajectory: Universal Trajectory Predictor using Diffusion Model (CVPR 2024)"
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
[RSS 2024] "DexCap: Scalable and Portable Mocap Data Collection System for Dexterous Manipulation" code repository
[CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation