Skip to content
View ShihMengLi's full-sized avatar

Highlights

  • Pro

Block or report ShihMengLi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"

Python 635 21 Updated Feb 20, 2025

[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Python 516 18 Updated Mar 10, 2025

A set of ComfyUI nodes providing additional control for the LTX Video model

Python 471 20 Updated Mar 5, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Jupyter Notebook 7,682 494 Updated Mar 7, 2025

LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors

Python 153 6 Updated Jan 3, 2025

[CVPR 2025] Prompt Depth Anything

Python 631 36 Updated Mar 4, 2025

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Python 1,211 69 Updated Dec 7, 2024

[ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Jupyter Notebook 309 16 Updated Feb 25, 2025

[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Python 496 14 Updated Dec 11, 2024

[CVPR 2025] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos

Python 361 22 Updated Mar 7, 2025

[CVPR'25] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 786 45 Updated Dec 8, 2024

This is an automatic full segmentation tool based on Segment-Anything-2 and Segment-Anything-1. Our tool performs automatic full segmentation of the video, enabling the tracking of each object and …

Python 150 6 Updated Oct 12, 2024

Depth Any Video with Scalable Synthetic Data (ICLR 2025)

Python 453 28 Updated Dec 4, 2024

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 4,224 302 Updated Oct 5, 2024

Inverse Painting: Reconstructing The Painting Process (SIGGRAPH ASIA 2024)

Python 173 6 Updated Dec 13, 2024

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 1,654 127 Updated Mar 13, 2025

[WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

Python 396 10 Updated Dec 16, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,409 4,298 Updated Mar 14, 2025

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Python 1,208 66 Updated Feb 27, 2025
Jupyter Notebook 33 Updated Jan 30, 2025

Official inference repo for FLUX.1 models

Python 20,823 1,465 Updated Feb 6, 2025

GLOMAP - Global Structured-from-Motion Revisited

C++ 1,671 119 Updated Mar 13, 2025
Python 935 68 Updated Feb 24, 2025

Grounding Image Matching in 3D with MASt3R

Python 1,877 150 Updated Jan 2, 2025

Understand Human Behavior to Align True Needs

Python 3,789 344 Updated Jul 20, 2024

Learning with 3D rotations, a hitchhiker’s guide to SO(3) - ICML 2024

Python 236 10 Updated Dec 22, 2024

[NeurIPS 2024 Spotlight] Implementation of the paper "3D Gaussian Splatting as Markov Chain Monte Carlo"

Python 492 21 Updated Jan 2, 2025

Karabiner-Elements complex ruleset to make using macOS friendlier by enabling common keyboard functionality used in Linux and Windows.

Jsonnet 376 73 Updated Feb 11, 2025
Next