Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 6,063 519 Updated Mar 7, 2025

CLAY-3D / OpenCLAY

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

896 11 Updated Jun 21, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,530 6,100 Updated Mar 7, 2025

santisoler / cc-licenses

Creative Commons Licenses for Github

580 306 Updated Dec 10, 2024

PyAV-Org / PyAV

Pythonic bindings for FFmpeg's libraries.

Cython 2,684 380 Updated Feb 25, 2025

JourneyDB / JourneyDB

165 5 Updated Jul 18, 2023

3DTopia / 3DTopia

Text-to-3D Generation within 5 Minutes

Python 696 50 Updated Mar 10, 2024

deepseek-ai / DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 3,646 543 Updated Apr 24, 2024

lllyasviel / sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Python 3,986 343 Updated Aug 30, 2024

mit-han-lab / distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Python 659 30 Updated Dec 2, 2024

lllyasviel / LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

2,089 30 Updated Jun 16, 2024

PixArt-alpha / PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,992 189 Updated Oct 31, 2024

apple / ml-mgie

Python 3,879 254 Updated Mar 15, 2024

TencentARC / MotionCtrl

Official Code for MotionCtrl [SIGGRAPH 2024]

Python 1,407 75 Updated Feb 19, 2025

YingqingHe / Awesome-LLMs-meet-Multimodal-Generation

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

HTML 436 25 Updated Feb 24, 2025

duyguceylan / pix2video

Code for the paper "Pix2Video: Video Editing using Image Diffusion"

Python 69 5 Updated Oct 2, 2023

lllyasviel / Fooocus

Focus on prompting and generating

Python 43,617 6,570 Updated Jan 24, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,713 2,385 Updated Aug 12, 2024

yt-dlp / yt-dlp

A feature-rich command-line audio/video downloader

Python 103,009 8,080 Updated Mar 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yazhou Xing yzxing87

Achievements

Achievements

Block or report yzxing87

Lists (1)

🔮 Future ideas

Stars

VideoVerses / VideoTuna

stepfun-ai / Step-Video-T2V

SamuelSchmidgall / AgentLaboratory

VideoVerses / VideoVAEPlus

baaivision / Emu3

apple / ml-depth-pro

facebookresearch / sapiens

lucidrains / transfusion-pytorch

google / RB-Modulation

xdit-project / xDiT

litwellchi / MMTrail

modelscope / ms-swift