Stars
这是一个segformer-pytorch的源码,可以用于训练自己的模型。
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
A generative world for general-purpose robotics & embodied AI learning.
UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation
Lifting ControlNet for Generalized Depth Conditioning
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
Amodal Depth Anything: Amodal Depth Estimation in the Wild
Official code of "Imagine360: Immersive 360 Video Generation from Perspective Anchor"
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Official implementation of OneDiffusion paper
MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
SAM4SS: Tailoring SAM and SAM2 for Semantic Segmentation
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Official implementation of Add-SD: Rational Generation without Manual Reference.
Official repository for paper "Open Panoramic Segmentation" (OPS) at ECCV 2024
Infinite Photorealistic Worlds using Procedural Generation
Official PyTorch implementation for a conditional diffusion probability model in BEV perception
[ECCV2024] Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Task-Customized Mixture of Adapters for General Image Fusion (CVPR 2024)
[CVPR' 24] Toolkit for 360Loc: A Dataset and Benchmark for Omnidirectional Visual Localization with Cross-device Queries Resources
Oracle Bone Script data collected by VLRLab of HUST
Muggled DPT: Depth estimation without the magic
[CVPR 2024] Code for "Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adaptation"
[IPCAI'2024 (IJCARS special issue)] Surgical-DINO: Adapter Learning of Foundation Models for Depth Estimation in Endoscopic Surgery
Low rank adaptation for Vision Transformer