Stars
Robust Speech Recognition via Large-Scale Weak Supervision
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
A TensorFlow implementation of DeepMind's WaveNet paper
Real time interactive streaming digital human
Multilingual Voice Understanding Model
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
use cnn recognize captcha by tensorflow. 本项目针对字符型图片验证码,使用tensorflow实现卷积神经网络,进行验证码识别。
Real time transcription with OpenAI Whisper.
VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama
中文语音识别; Mandarin Automatic Speech Recognition;
[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRChat and Overlays in most Streaming Applications
The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!
Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.