This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,052 2,339 Updated Nov 26, 2024

yakami129 / VirtualWife

VirtualWife是一个虚拟数字人项目，支持B站直播，支持openai、ollama

Python 2,119 324 Updated Oct 27, 2024

rpdrewes / whisper-websocket-server

Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.

Python 55 7 Updated Dec 30, 2023

Sharrnah / whispering

Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRChat and Overlays in most Streaming Applications

Python 406 31 Updated Dec 24, 2024

Fictionarry / ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Python 1,102 140 Updated Jul 12, 2024

jxlpzqc / TMSpeech

腾讯会议摸鱼工具

C# 586 51 Updated Nov 21, 2024

Const-me / Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

C++ 8,684 745 Updated Aug 3, 2024

davabase / whisper_real_time

Real time transcription with OpenAI Whisper.

Python 2,466 415 Updated Jun 1, 2024

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 13,072 1,401 Updated Dec 18, 2024

ibab / tensorflow-wavenet

A TensorFlow implementation of DeepMind's WaveNet paper

Python 5,420 1,292 Updated Jul 12, 2023

r9y9 / wavenet_vocoder

WaveNet vocoder

Python 2,334 500 Updated Jul 29, 2023

TensorSpeech / TensorFlowTTS

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,863 816 Updated Jul 5, 2024

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,325 1,866 Updated Dec 27, 2024

VOICEVOX / voicevox

無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXのエディター

TypeScript 2,567 309 Updated Dec 29, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,478 8,779 Updated Dec 1, 2024

shibing624 / parrots

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成，支持多语言，准确率高

Python 483 89 Updated Dec 4, 2024

jackaduma / CycleGAN-VC2

Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2

Python 543 108 Updated Jun 10, 2023

xdcesc / my_ch_speech_recognition

使用python进行语音识别

Python 141 541 Updated Feb 16, 2022

nobody132 / masr

中文语音识别; Mandarin Automatic Speech Recognition;

Python 1,894 482 Updated Jul 25, 2024

babysor / MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 35,563 5,219 Updated Nov 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mengguanzhou

Block or report mengguanzhou

Stars

FunAudioLLM / SenseVoice

lipku / LiveTalking

EricGuo5513 / momask-codes

yoututu2023 / computer_vision_projects

huailiang / LipSync

THUDM / ChatGLM-6B

wanggang1987 / fast_sadtalker

kenwaytis / faster-SadTalker-API

bincooo / whisper-api

OpenTalker / SadTalker

Rudrabha / Wav2Lip