Stars
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
A set of ComfyUI nodes providing additional control for the LTX Video model
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
[ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation
[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
[CVPR 2025] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos
[CVPR'25] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
This is an automatic full segmentation tool based on Segment-Anything-2 and Segment-Anything-1. Our tool performs automatic full segmentation of the video, enabling the tracking of each object and …
Depth Any Video with Scalable Synthetic Data (ICLR 2025)
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
Inverse Painting: Reconstructing The Painting Process (SIGGRAPH ASIA 2024)
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
[WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Official inference repo for FLUX.1 models
Understand Human Behavior to Align True Needs
Learning with 3D rotations, a hitchhiker’s guide to SO(3) - ICML 2024
[NeurIPS 2024 Spotlight] Implementation of the paper "3D Gaussian Splatting as Markov Chain Monte Carlo"
Karabiner-Elements complex ruleset to make using macOS friendlier by enabling common keyboard functionality used in Linux and Windows.