Lists (5)
Sort Name ascending (A-Z)
Stars
Official PyTorch Implementation of "History-Guided Video Diffusion"
Official PyTorch implementation of "Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization" (ECCV 2024)
Memory-optimized training library for diffusion models
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
[CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation"
A course on aligning smol models.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
[CVPR-W 2023] Official Implementation of One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models
[NeurIPS 2024] Official implementation of "Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance"
The official repo for "SurFhead: Affine Rig Blending for Geometrically Accurate 2D Gaussian Surfel Head Avatars"
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
[ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control
Official PyTorch implementation of RobustNet (CVPR 2021 Oral)
Official PyTorch implementation of HANet (CVPR 2020)
[NeurIPS2023] DatasetDM:Synthesizing Data with Perception Annotations Using Diffusion Models
We learn the dynamics model of a robot using a physics-informed neural network and use it to train a model-based RL algorithm.
[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
[ICCV 2023] Code for "Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement"
Low rank adaptation for Vision Transformer
Official adversarial mixup resynthesis repository