Skip to content
View hwscut's full-sized avatar

Block or report hwscut

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1,805 123 Updated Dec 17, 2024

EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 1,848 217 Updated Dec 16, 2024

Kolors Team

Python 4,012 294 Updated Nov 13, 2024

Fine-tune BERT models to classify Arabic text by different dialects.

Jupyter Notebook 14 6 Updated Aug 8, 2023

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,236 116 Updated Apr 24, 2024

An Open-Sourced LLM-empowered Foundation TTS System

Python 496 35 Updated Oct 17, 2024

Real-time face swap for PC streaming or video calls

Python 27,158 127 Updated Nov 8, 2024

Repository for training models for music source separation.

Python 531 77 Updated Dec 15, 2024

Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation

Python 187 10 Updated Nov 12, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 8,076 1,027 Updated Dec 18, 2024

Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

Python 4,443 632 Updated Dec 13, 2024

This repository gives the official implementation of Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models (WACV 2025)

Python 60 8 Updated Oct 28, 2024

GLM-4-Voice | 端到端中英语音对话模型

Python 2,473 198 Updated Dec 5, 2024

Multilingual Voice Understanding Model

Python 3,731 331 Updated Nov 29, 2024

[ACM TOG, 2024] Identity-Preserving Face Swapping via Dual Surrogate Generative Models

Python 35 5 Updated Nov 20, 2024

[ICCV 2023] BlendFace: Re-designing Identity Encoders for Face-Swapping https://arxiv.org/abs/2307.10854

Python 175 7 Updated Sep 28, 2023

A Survey on Deepfake Generation and Detection

352 14 Updated Dec 18, 2024

Official Implementation of 'ReliableSwap: Boosting General Face Swapping Via Reliable Supervision'

Python 198 16 Updated Sep 28, 2023

A new one shot face swap approach for image and video domains

Python 1,293 270 Updated Jul 14, 2024

a node for comfyui for restore/edit/enchance faces utilizing face recognition

Python 162 14 Updated Jun 17, 2024

[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation

Python 1,505 113 Updated Dec 12, 2024

Official PyTorch implementation of Revisiting Image Pyramid Structure for High Resolution Salient Object Detection (ACCV 2022)

Python 516 73 Updated Jan 29, 2024

TTS models for Arabic (Tacotron2, FastPitch)

Jupyter Notebook 95 24 Updated Nov 5, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 60,428 6,425 Updated Dec 18, 2024

Official inference repo for FLUX.1 models

Python 18,462 1,309 Updated Nov 21, 2024

Official implementation of AnimateDiff.

Python 10,737 877 Updated Jul 31, 2024

JoyHallo: Digital human model for Mandarin

Python 393 39 Updated Nov 21, 2024

WeCube Platform

Go 367 86 Updated Dec 18, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 22,671 2,225 Updated Nov 28, 2024

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Python 2,825 197 Updated Nov 27, 2024
Next