Skip to content
View hysts's full-sized avatar

Block or report hysts

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

InspireMusic: A Unified Framework for Music, Song, Audio Generation.

Python 202 15 Updated Dec 12, 2024

Realtime Video and Audio Streaming with WebRTC and Gradio

Python 102 13 Updated Dec 12, 2024

Official code for "ControlAR: Controllable Image Generation with Autoregressive Models"

Python 153 4 Updated Dec 12, 2024

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 47 1 Updated Dec 12, 2024

You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale

Python 345 8 Updated Dec 11, 2024

[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 155 9 Updated Dec 11, 2024

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 965 33 Updated Dec 12, 2024

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Jupyter Notebook 515 15 Updated Dec 9, 2024

Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance

Jupyter Notebook 61 2 Updated Dec 6, 2024

Official implementation of the paper “MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control”

Python 181 7 Updated Dec 10, 2024

[NeurIPS 2024] L4GM: Large 4D Gaussian Reconstruction Model

Python 115 5 Updated Dec 2, 2024

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".

Python 3,822 182 Updated Dec 7, 2024
Python 138 5 Updated Dec 7, 2024

[arXiv'24] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos

Python 207 6 Updated Dec 12, 2024

Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision"

56 1 Updated Dec 7, 2024

Boosting Generative Novel View Synthesis with Sparse and Unposed Images

Python 40 1 Updated Dec 9, 2024

A minimal and universal controller for FLUX.1.

Python 868 46 Updated Dec 10, 2024

🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 158 1 Updated Dec 12, 2024

[ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3

240 12 Updated Dec 11, 2024

A course on aligning smol models.

Jupyter Notebook 3,041 849 Updated Dec 12, 2024
Python 464 11 Updated Dec 12, 2024
Python 80 4 Updated Dec 6, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 5,602 390 Updated Dec 12, 2024

High-quality and editable surfel Gaussian generation through native 3D diffusion.

Python 194 10 Updated Dec 12, 2024

Efficient Track Anything

Python 363 9 Updated Dec 12, 2024

Pytorch implementation of "PersonaCraft: Personalized Full-Body Image Synthesis for Multiple Identities from Single References Using 3D-Model-Conditioned Diffusion"

37 Updated Dec 2, 2024

Video Depth without Video Models

Python 339 10 Updated Dec 9, 2024

Official implementation of "Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters"

152 5 Updated Dec 5, 2024

Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)

Python 306 7 Updated Dec 5, 2024
Next
Showing results