Skip to content
View duyuankai1992's full-sized avatar

Block or report duyuankai1992

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the official code repository for the paper 'Improving Gloss-free Sign Language Translation by Reducing Representation Density'.

Python 25 1 Updated Nov 27, 2024

[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation

Python 392 38 Updated Dec 31, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,469 982 Updated Jan 22, 2025

The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".

Python 50 2 Updated Jul 23, 2024

A generative speech model for daily dialogue.

Python 34,010 3,684 Updated Jan 25, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,904 528 Updated Dec 25, 2024

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 3,851 324 Updated Jan 13, 2025
Python 294 61 Updated Sep 24, 2024

Action recognition application using models trained on WLASL dataset to translate ASL to English.

Python 1 1 Updated Oct 14, 2024

WACV 2020 "Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison"

Python 896 116 Updated Mar 18, 2023

Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis

Python 1,452 144 Updated Jul 29, 2024

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

Python 736 65 Updated Jul 3, 2024

Grok open release

Python 49,878 8,346 Updated Aug 30, 2024

Chinese Stable Diffusion, zh SD,中文文生图,中文SD,中文Stable Diffusion

Python 47 4 Updated Mar 11, 2024

A multimodal large language model for ocr. OCR_MLLM

Python 3 2 Updated Mar 13, 2024
Python 5,615 920 Updated Jan 27, 2025

C++开发的视频行为分析系统v4版本

C++ 154 40 Updated Jan 28, 2025

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,067 547 Updated Oct 24, 2024

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

7,566 929 Updated Aug 21, 2024

VideoSys: An easy and efficient system for video generation

Python 1,895 129 Updated Jan 1, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 15,991 1,023 Updated Jan 31, 2025

A demo application using fal.realtime and the lightning fast SDXL API provided by fal

JavaScript 543 145 Updated Sep 24, 2024

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Python 6,010 857 Updated May 13, 2024

PhotoMaker [CVPR 2024]

Jupyter Notebook 9,758 778 Updated Oct 31, 2024

🩹Editing large language models within 10 seconds⚡

Python 1,305 95 Updated Aug 13, 2023

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 3,040 270 Updated Jan 10, 2025

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,579 981 Updated Jul 26, 2024

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Python 36,789 3,800 Updated Jan 2, 2025

Recent LLM-based CV and related works. Welcome to comment/contribute!

849 36 Updated Jun 5, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,108 1,709 Updated Jan 29, 2025
Next