Stars
[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.
Awesome Controllable Video Generation with Diffusion Models
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
[ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
KolvacS-W / prompt-to-prompt
Forked from google/prompt-to-promptworkable version on diffusers 0.17.1
[AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.
Character Animation (AnimateAnyone, Face Reenactment)
official implement for 《LiON: Learning Point-wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data》
MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving
[ICCV23] DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection
[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies
This repository is an official implementation of ADAPT: Action-aware Driving Caption Transformer, accepted by ICRA 2023.
[ICRA 2023] From Semi-supervised to Omni-supervised Room Layout Estimation Using Point Clouds