Stars
HunyuanVideo: A Systematic Framework For Large Video Generation Model
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Open-Sora: Democratizing Efficient Video Production for All
Lumina-T2X is a unified framework for Text to Any Modality Generation
Openpose editor for ControlNet. Full hand/face support.
Robust Human Matting via Semantic Guidance, ACCV 2022.
[AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
PyTorch code for our paper "Image Super-Resolution with Text Prompt Diffusion"
[CSUR] A Survey on Video Diffusion Models
Character Animation (AnimateAnyone, Face Reenactment)
Latent Couple extension (two shot diffusion port)
Unofficial Implementation of Animate Anyone
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"
[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution
Kandinsky 2 — multilingual text2image latent diffusion model
[CVPR 2024] CoSeR: Bridging Image and Language for Cognitive Super-Resolution
[ECCV2024] Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization
[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
Image to prompt with BLIP and CLIP
one summary of diffusion-based image processing, including restoration, enhancement, coding, quality assessment
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution
You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…