Skip to content
View hwscut's full-sized avatar

Block or report hwscut

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

https://hf.co/hexgrad/Kokoro-82M

JavaScript 1,478 147 Updated Mar 1, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 102,562 16,615 Updated Mar 7, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 43,232 3,858 Updated Mar 7, 2025

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 79,447 11,595 Updated Mar 7, 2025

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 1,984 161 Updated Feb 10, 2025

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 1,712 152 Updated Feb 24, 2025

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 33,787 2,400 Updated Mar 7, 2025

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 3,637 333 Updated Mar 7, 2025

Fully open reproduction of DeepSeek-R1

Python 22,321 2,000 Updated Mar 7, 2025

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 440 34 Updated Feb 14, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,355 177 Updated Feb 14, 2025

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 3,115 361 Updated Feb 27, 2025

Kolors Team

Python 4,244 321 Updated Nov 13, 2024

Fine-tune BERT models to classify Arabic text by different dialects.

Jupyter Notebook 15 6 Updated Aug 8, 2023

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,304 122 Updated Apr 24, 2024

An Open-Sourced LLM-empowered Foundation TTS System

Python 631 50 Updated Oct 17, 2024

Real-time face swap for PC streaming or video calls

Python 27,760 323 Updated Nov 8, 2024

Repository for training models for music source separation.

Python 647 88 Updated Feb 16, 2025

Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation

Python 208 14 Updated Feb 25, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 10,104 1,384 Updated Feb 24, 2025

Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

Python 3,500 505 Updated Feb 27, 2025

This repository gives the official implementation of Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models (WACV 2025)

Python 89 10 Updated Jan 28, 2025

GLM-4-Voice | 端到端中英语音对话模型

Python 2,726 222 Updated Dec 5, 2024

Multilingual Voice Understanding Model

Python 4,792 432 Updated Jan 8, 2025

[ACM TOG, 2024] Identity-Preserving Face Swapping via Dual Surrogate Generative Models

Python 43 7 Updated Jan 9, 2025

[ICCV 2023] BlendFace: Re-designing Identity Encoders for Face-Swapping https://arxiv.org/abs/2307.10854

Python 188 9 Updated Sep 28, 2023

A Survey on Deepfake Generation and Detection

418 19 Updated Feb 21, 2025

Official Implementation of 'ReliableSwap: Boosting General Face Swapping Via Reliable Supervision'

Python 205 17 Updated Sep 28, 2023
Next