Starred repositories
🦜🔗 Build context-aware reasoning applications
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
SkyReels V1: The first and most advanced open-source human-centric video foundation model
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
Fully open reproduction of DeepSeek-R1
LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Fine-tune BERT models to classify Arabic text by different dialects.
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
An Open-Sourced LLM-empowered Foundation TTS System
Real-time face swap for PC streaming or video calls
Repository for training models for music source separation.
Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
This repository gives the official implementation of Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models (WACV 2025)
Multilingual Voice Understanding Model
[ACM TOG, 2024] Identity-Preserving Face Swapping via Dual Surrogate Generative Models
[ICCV 2023] BlendFace: Re-designing Identity Encoders for Face-Swapping https://arxiv.org/abs/2307.10854
A Survey on Deepfake Generation and Detection
Official Implementation of 'ReliableSwap: Boosting General Face Swapping Via Reliable Supervision'