Open source impl of **MV-DUSt3R+ Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds** from Meta Reality Labs. Project page https://mv-dust3rp.github.io/

Python 322 9 Updated Feb 3, 2025

Qi-Zhangyang / GPT4Scene

GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models

Python 67 Updated Jan 22, 2025

DepthAnything / Video-Depth-Anything

Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Python 485 29 Updated Feb 2, 2025

vye16 / shape-of-motion

Python 872 64 Updated Aug 13, 2024

NVlabs / FoundationStereo

JavaScript 342 8 Updated Jan 21, 2025

Parskatt / RoMa

[CVPR 2024] RoMa: Robust Dense Feature Matching; RoMa is the robust dense feature matcher capable of estimating pixel-dense warps and reliable certainties for almost any image pair.

Python 711 60 Updated Nov 20, 2024

xuelunshen / gim

GIM: Learning Generalizable Image Matcher From Internet Videos (ICLR 2024 Spotlight)

Python 598 30 Updated Dec 29, 2024

yaqding / pose_monodepth

C++ 28 1 Updated Feb 1, 2025

xuxw98 / ESAM

[ICLR 2025] EmbodiedSAM: Online Segment Any 3D Thing in Real Time

Python 237 13 Updated Jan 23, 2025

yunjinli / SADG-SegmentAnyDynamicGaussian

SADG: Segment Any Dynamic Gaussian Without Object Trackers

Python 28 1 Updated Jan 20, 2025

yangchris11 / samurai

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,443 401 Updated Jan 29, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,904 1,403 Updated Dec 25, 2024

zju3dv / MatchAnything

Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.

711 19 Updated Jan 14, 2025

MarkYu98 / madpose

Official implementation of the solvers and estimators proposed in the paper "Relative Pose Estimation through Affine Corrections of Monocular Depth Priors"

C++ 94 4 Updated Feb 5, 2025

Junyi42 / monst3r

Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"

Python 961 55 Updated Jan 20, 2025

ZcsrenlongZ / Deblur4DGS

[arXiv 2024] Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular Video

Python 29 1 Updated Jan 13, 2025

genforce / JOSH

98 2 Updated Jan 7, 2025

IGL-HKUST / DiffusionAsShader

[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

411 12 Updated Jan 12, 2025

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,372 458 Updated Jan 28, 2025

OpenDriveLab / AgiBot-World

World's First Large-scale High-quality Robotic Manipulation Benchmark

Python 1,284 85 Updated Jan 20, 2025

PKU-VCL-3DV / SLAM3R

Real-time dense scene reconstruction with SLAM3R

Python 401 12 Updated Jan 6, 2025

microsoft / MoGe

MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 716 41 Updated Dec 8, 2024

cvg / depthsplat

DepthSplat: Connecting Gaussian Splatting and Depth

Python 672 31 Updated Nov 15, 2024

fangzhou2000 / DrivingForward

[AAAI 2025] Offical implementation of "DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible Surround-view Input"

Python 66 3 Updated Dec 21, 2024

DepthAnything / PromptDA

Prompt Depth Anything

Python 517 25 Updated Jan 16, 2025

ant-research / DepthLab

Official implementation of "DepthLab: From Partial to Complete"

Python 426 22 Updated Jan 16, 2025

IGL-HKUST / das

Diffusion as Shader: 3D-aware Controllable Video Generation

JavaScript 2 Updated Jan 8, 2025