
- Hong Kong
-
03:49
(UTC +08:00)
Highlights
- Pro
Starred repositories
[CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"
Official implementation of TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models
[CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Wan: Open and Advanced Large-Scale Video Generative Models
[CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
[CVPR'25] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
[CVPR 2025] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
[NeurIPS 2024] Neural Localizer Fields for Continuous 3D Human Pose and Shape Estimation
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
Official code for "SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation"
EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild
FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024
Official code for "There is no SAMantics! Exploring SAM as a Backbone for Visual Understanding Tasks"
Official repository for "SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE"
Official Implementation of the ICCV 2023 paper: Perpetual Humanoid Control for Real-time Simulated Avatars
Multi-Joint dynamics with Contact. A general purpose physics simulator.
Isaac Gym Reinforcement Learning Environments
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?
[SIGGRAPH Asia 2024] PuzzleAvatar: Assembling 3D Avatars from Personal Albums
The official PyTorch code for RoHM: Robust Human Motion Reconstruction via Diffusion.
Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)
The Fast Way From Vertices to Parametric 3D Humans