Skip to content
View yangshurong's full-sized avatar

Block or report yangshurong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,407 230 Updated Jun 14, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 19,835 1,498 Updated Dec 27, 2024

FastVideo is an open-source framework for accelerating large video diffusion model.

Python 656 38 Updated Dec 27, 2024

【2024年新版】国科大 陈云霁 智能计算系统AICS实验代码

Python 222 22 Updated May 31, 2024

BMVC'23 | FiveA+Network: You Only Need 9K Parameters for Underwater Image Enhancement

Python 55 2 Updated Nov 5, 2023

ECCV'22 Oral | Perceiving and Modeling Density for Single Image Dehazing.

Python 57 6 Updated May 6, 2023

[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"

Python 1,286 90 Updated Mar 20, 2024

Official implementation of AAAI-2024 paper "Boosting Multiple Instance Learning Models for Whole Slide Image Classification: A Model-agnostic Framework Based on Counterfactual Inference"

Python 7 1 Updated Jul 1, 2024

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 9,642 1,319 Updated Sep 14, 2024

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 485 29 Updated Oct 25, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,811 1,037 Updated Dec 27, 2024

Official implementation of ID-unaware Deepfake Detection Model

C++ 160 21 Updated Aug 15, 2023

[Siggraph Asia 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"

Python 346 27 Updated Sep 11, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 6,754 510 Updated Dec 25, 2024

EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 2,020 235 Updated Dec 26, 2024

Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)

1,611 151 Updated Dec 25, 2024

Official implementation of AnimateDiff.

Python 10,779 878 Updated Jul 31, 2024

Let's finetune video generation models!

Python 339 11 Updated Dec 22, 2024

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

HTML 394 22 Updated Dec 24, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,106 998 Updated Dec 24, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,805 1,664 Updated Dec 19, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,826 27,393 Updated Dec 27, 2024

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,704 209 Updated Dec 23, 2024

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Python 671 43 Updated Dec 5, 2024

Official repository for LTX-Video

Python 2,203 156 Updated Dec 20, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 26,852 5,521 Updated Dec 27, 2024

🎓 Update Talking-Face Research Papers Daily, Now Integrated with LLM Analysis.

Python 165 16 Updated Dec 27, 2024

Endora: Video Generation Models as Endoscopy Simulators (MICCAI 2024)

Python 126 6 Updated Aug 30, 2024

[ECCV 2022] CelebV-HQ: A Large-Scale Video Facial Attributes Dataset

Python 404 29 Updated Jan 4, 2023

EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 3,308 385 Updated Dec 10, 2024
Next