Skip to content
View xiaomabufei's full-sized avatar

Block or report xiaomabufei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 795 39 Updated Dec 17, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,828 89 Updated Jan 15, 2025

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,559 97 Updated Jan 17, 2025
1 Updated Jan 10, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,011 426 Updated Jan 9, 2025

[Arxiv 2024] Edicho: Consistent Image Editing in the Wild

93 1 Updated Jan 14, 2025

FastVideo is a lightweight framework for accelerating large video diffusion models.

Python 871 51 Updated Jan 18, 2025

Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

Python 113 3 Updated Dec 20, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 22,964 1,894 Updated Jan 18, 2025

Memory-optimized training scripts for video models based on Diffusers

Python 752 79 Updated Jan 17, 2025

Official Implementations for Paper - AniDoc: Animation Creation Made Easier

Python 447 29 Updated Dec 31, 2024
Python 34 Updated Dec 20, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 63,945 6,840 Updated Jan 18, 2025
Python 26 Updated Dec 12, 2024

Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text

Python 27 Updated Jan 9, 2025

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Python 1,183 83 Updated Jun 15, 2024

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 4,066 368 Updated Apr 8, 2024

More relighting!

Python 7,369 435 Updated Nov 28, 2024

Subjects200K dataset

Jupyter Notebook 90 3 Updated Jan 17, 2025

A minimal and universal controller for FLUX.1.

Python 1,103 70 Updated Jan 17, 2025

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,127 88 Updated Aug 6, 2024

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 2,764 160 Updated Jan 16, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,951 476 Updated Nov 5, 2024
Python 1,783 55 Updated Jun 28, 2024

Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python 2,645 250 Updated Jan 4, 2025

The best OSS video generation models

Python 2,725 281 Updated Jan 8, 2025

Official PyTorch implementation of "Framer: Interactive Frame Interpolation".

Python 405 18 Updated Jan 9, 2025

[NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding

Python 39 3 Updated Jan 14, 2025

[NeurIPS'23] Emergent Correspondence from Image Diffusion

Python 643 36 Updated May 14, 2024

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Python 1,822 128 Updated Feb 23, 2024
Next