hwscut

JacksonWong hwscut

Starred repositories

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1,805 123 Updated Dec 17, 2024

antgroup / echomimic_v2

EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 1,848 217 Updated Dec 16, 2024

Kwai-Kolors / Kolors

Kolors Team

Python 4,012 294 Updated Nov 13, 2024

Lafifi-24 / arabic-dialect-identification

Fine-tune BERT models to classify Arabic text by different dialects.

Jupyter Notebook 14 6 Updated Aug 8, 2023

microsoft / SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,236 116 Updated Apr 24, 2024

FireRedTeam / FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Python 496 35 Updated Oct 17, 2024

iperov / DeepFaceLive

Real-time face swap for PC streaming or video calls

Python 27,158 127 Updated Nov 8, 2024

ZFTurbo / Music-Source-Separation-Training

Repository for training models for music source separation.

Python 531 77 Updated Dec 15, 2024

Hanbo-Cheng / DAWN-pytorch

Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation

Python 187 10 Updated Nov 12, 2024

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 8,076 1,027 Updated Dec 18, 2024

fudan-generative-vision / hallo2

Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

Python 4,443 632 Updated Dec 13, 2024

Sanoojan / REFace

This repository gives the official implementation of Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models (WACV 2025)

Python 60 8 Updated Oct 28, 2024

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,473 198 Updated Dec 5, 2024

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 3,731 331 Updated Nov 29, 2024

ICTMCG / CSCS

[ACM TOG, 2024] Identity-Preserving Face Swapping via Dual Surrogate Generative Models

Python 35 5 Updated Nov 20, 2024

mapooon / BlendFace

[ICCV 2023] BlendFace: Re-designing Identity Encoders for Face-Swapping https://arxiv.org/abs/2307.10854

Python 175 7 Updated Sep 28, 2023

flyingby / Awesome-Deepfake-Generation-and-Detection

A Survey on Deepfake Generation and Detection

352 14 Updated Dec 18, 2024

ygtxr1997 / ReliableSwap

Official Implementation of 'ReliableSwap: Boosting General Face Swapping Via Reliable Supervision'

Python 198 16 Updated Sep 28, 2023

ai-forever / ghost

A new one shot face swap approach for image and video domains

Python 1,293 270 Updated Jul 14, 2024

nicofdga / DZ-FaceDetailer

a node for comfyui for restore/edit/enchance faces utilizing face recognition

Python 162 14 Updated Jun 17, 2024

ZhengPeng7 / BiRefNet

[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation

Python 1,505 113 Updated Dec 12, 2024

plemeri / InSPyReNet

Official PyTorch implementation of Revisiting Image Pyramid Structure for High Resolution Salient Object Detection (ACCV 2022)

Python 516 73 Updated Jan 29, 2024

nipponjo / tts-arabic-pytorch

TTS models for Arabic (Tacotron2, FastPitch)

Jupyter Notebook 95 24 Updated Nov 5, 2024

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 60,428 6,425 Updated Dec 18, 2024

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 18,462 1,309 Updated Nov 21, 2024

guoyww / AnimateDiff

Official implementation of AnimateDiff.

Python 10,737 877 Updated Jul 31, 2024

jdh-algo / JoyHallo

JoyHallo: Digital human model for Mandarin

Python 393 39 Updated Nov 21, 2024

WeBankPartners / wecube-platform

WeCube Platform

Go 367 86 Updated Dec 18, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 22,671 2,225 Updated Nov 28, 2024

ToTheBeginning / PuLID

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

JacksonWong hwscut

Starred repositories

web3

voice-cloning

pose-estimation

vio

sensor-fusion

3d-reconstruction