Stars
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation
[WIP] Layer Diffusion for WebUI (via Forge)
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
📷 EasyPhoto | Your Smart AI Photo Generator.
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Code Repository for CVPR 2023 Paper "PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360 degree"
[CSUR] A Survey on Video Diffusion Models
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
Official Pytorch Implementation of DenseDiffusion (ICCV 2023)
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image of a face as input and produces a 3D reconstruction. EMOCA …
Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".
Generative Models by Stability AI
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"
An official implementation of "Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding" (CVPR 2023) in PyTorch.
[CVPR 2022] Official PyTorch Implementation for DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models
[CVPR 2023] Collaborative Diffusion
Face recognition with deep neural networks.
A collection of resources on controllable generation with text-to-image diffusion models.