Stars
This is the official code repository for the paper 'Improving Gloss-free Sign Language Translation by Reducing Representation Density'.
[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".
A generative speech model for daily dialogue.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Action recognition application using models trained on WLASL dataset to translate ASL to English.
WACV 2020 "Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison"
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
Chinese Stable Diffusion, zh SD,中文文生图,中文SD,中文Stable Diffusion
A multimodal large language model for ocr. OCR_MLLM
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
VideoSys: An easy and efficient system for video generation
OCR, layout analysis, reading order, table recognition in 90+ languages
A demo application using fal.realtime and the lightning fast SDXL API provided by fal
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
🩹Editing large language models within 10 seconds⚡
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手
Recent LLM-based CV and related works. Welcome to comment/contribute!
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.