Lists (1)
Sort Oldest
Stars
[arXiv'25] Official Implementation of "Pix2Cap-COCO: Advancing Visual Comprehension via Pixel-Level Captioning"
Potential of 2D Priors for Improving Robustness of Ill-Posed 3D Reconstruction
An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.
Code for paper https://arxiv.org/abs/2409.02958
Official Implementation of the ICLR 2024 spotlight paper: Universal Humanoid Motion Representations for Physics-Based Control
Official Implementation of the ICCV 2023 paper: Perpetual Humanoid Control for Real-time Simulated Avatars
BEDLAM (CVPR 2023) render pipeline tools
[NeuRIPS, 2024] Multi-Human Dataset for Close Interactions.
Code for the paper "Learning from Massive Human Videos for Universal Humanoid Pose Control"
[ECCV 2024] Official Implementation of "CoMusion: Towards Consistent Stochastic Human Motion Prediction via Motion Diffusion".
Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"
Code for evaluating uncertainty estimation methods for Transformer-based architectures in natural language understanding tasks.
4DHumans: Reconstructing and Tracking Humans with Transformers
Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024
[CVPR 2024] TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation
General API for Deep Bayesian Variational Inference by Backpropagation. The repository has been designed to work with Transformers like architectures. Compatible with the HuggingFace Transformers m…
Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"
Code to access and generate ProciGen dataset, CVPR'24.
Official implementation for Hierarachical Diffusion Model in CVPR24 Template free reconstruction of human object interaction
Digital Human Resource Collection: 2D/3D/4D human modeling, avatar generation & animation, clothed people digitalization, virtual try-on, and others.
Implementation of the "Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos" paper.
deep learning for image processing including classification and object-detection etc.
Source code for the GCPR 2022 paper: <InterCap: Joint Markerless 3D Tracking of Humans and Objects in Interaction>