Lists (10)
Sort Name ascending (A-Z)
🚀3D/4D
Related to 3D, 4D, Nerf, 3D GS, etc.🚀 Collections
A stack of collections, surveys, etc.🚀Diffusion Models
All diffusion models related🚀FMs & VL
Anything Models🚀Image Classification
🚀Implementation
Implementation of papers and methods🚀 Learn
Learning material🚀Object Detection
Starred repositories
we propose to generate a series of geometric shapes with target colors to disentangle (or peel off ) the target colors from the shapes. By jointly learning on multiple color-shape images, we found …
A novel approach to hunyuan image-to-video sampling
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
Official PyTorch implementation of ECCV 2024 Paper: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.
Motion-Controllable Video Diffusion via Warped Noise
[arXiv 2025] Official pytorch implementation of "FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors"
[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
A light-weighted extension that brings you the best image browsing experience in VS Code, especially for remote / cloud development.
[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
Official repo for arxiv paper: Aligning Visual and Semantic Interpretability through Visually Grounded Concept Bottleneck Models
[NeurIPS 2024] Official code for "Neural Gaffer: Relighting Any Object via Diffusion"
MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multil…
Train transformer language models with reinforcement learning.
A general fine-tuning kit geared toward diffusion models.
Official PyTorch implementation of the paper: Flow Matching in Latent Space
FMBoost: Boosting Latent Diffusion with Flow Matching (ECCV 2024 Oral)
Implementation of rectified flow and some of its followup research / improvements in Pytorch
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
[CVPR2024] 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)
(SIGGRAPH Asia 2024) This is the official PyTorch implementation of SIGGRAPH Asia 2024 paper: DrawingSpinUp: 3D Animation from Single Character Drawings
Open-source and strong foundation image recognition models.
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"