duyuankai1992

Follow

duyuankai1992

Follow

33 followers · 993 following

Stars

JinhuiYE / SignCL

This is the official code repository for the paper 'Improving Gloss-free Sign Language Translation by Reducing Representation Density'.

Python 25 1 Updated Nov 27, 2024

tanshuai0219 / EDTalk

[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation

Python 392 38 Updated Dec 31, 2024

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,469 982 Updated Jan 22, 2025

ZjjConan / VLM-MultiModalAdapter

The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".

Python 50 2 Updated Jul 23, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 34,010 3,684 Updated Jan 25, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,904 528 Updated Dec 25, 2024

Tencent / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 3,851 324 Updated Jan 13, 2025

FangyunWei / SLRT

Python 294 61 Updated Sep 24, 2024

mloet / ASL-Recognizer

Action recognition application using models trained on WLASL dataset to translate ASL to English.

Python 1 1 Updated Oct 14, 2024

dxli94 / WLASL

WACV 2020 "Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison"

Python 896 116 Updated Mar 18, 2023

ShineChen1024 / MagicClothing

Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis

Python 1,452 144 Updated Jul 29, 2024

Boese0601 / MagicDance

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

Python 736 65 Updated Jul 3, 2024

xai-org / grok-1

Grok open release

Python 49,878 8,346 Updated Aug 30, 2024

ShineChen1024 / MiaoBi

Chinese Stable Diffusion, zh SD,中文文生图，中文SD，中文Stable Diffusion

Python 47 4 Updated Mar 11, 2024

SuXuping / OCR_MLLM_TOY

A multimodal large language model for ocr. OCR_MLLM

Python 3 2 Updated Mar 13, 2024

kohya-ss / sd-scripts

Python 5,615 920 Updated Jan 27, 2025

beixiaocai / xcms

C++开发的视频行为分析系统v4版本

C++ 154 40 Updated Jan 28, 2025

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,067 547 Updated Oct 24, 2024

HumanAIGC / EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,566 929 Updated Aug 21, 2024

NUS-HPC-AI-Lab / VideoSys

VideoSys: An easy and efficient system for video generation

Python 1,895 129 Updated Jan 1, 2025

VikParuchuri / surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 15,991 1,023 Updated Jan 31, 2025

fal-ai / real-time-demo-app

A demo application using fal.realtime and the lightning fast SDXL API provided by fal

JavaScript 543 145 Updated Sep 24, 2024

levihsu / OOTDiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Python 6,010 857 Updated May 13, 2024

TencentARC / PhotoMaker

PhotoMaker [CVPR 2024]

Jupyter Notebook 9,758 778 Updated Oct 31, 2024

hiyouga / FastEdit

🩹Editing large language models within 10 seconds⚡

Python 1,305 95 Updated Aug 13, 2023

ali-vilab / VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 3,040 270 Updated Jan 10, 2025

HumanAIGC / AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,579 981 Updated Jul 26, 2024

LC044 / WeChatMsg

提取微信聊天记录，将其导出成HTML、Word、Excel文档永久保存，对聊天记录进行分析生成年度聊天报告，用聊天数据训练专属于个人的AI聊天助手

Python 36,789 3,800 Updated Jan 2, 2025

DirtyHarryLYL / LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

849 36 Updated Jun 5, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,108 1,709 Updated Jan 29, 2025