Diffusion
Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training
Official Code for DragGAN (SIGGRAPH 2023)
Official implementations for paper: LivePhoto: Real Image Animation with Text-guided Motion Control
Official implementations for paper: Anydoor: zero-shot object-level image customization
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
Transparent Image Layer Diffusion using Latent Transparency
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Open-Sora: Democratizing Efficient Video Production for All
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
VideoSys: An easy and efficient system for video generation
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024
An open-source toolbox for fast sampling of diffusion models. Official implementations of our works published in ICML, NeurIPS, CVPR.
[NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"
A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.
[NeurIPS 24] Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation
[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!
Code for the paper DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents, ICML 2024