Skip to content
View mengguanzhou's full-sized avatar

Block or report mengguanzhou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Multilingual Voice Understanding Model

Python 3,846 343 Updated Nov 29, 2024

Real time interactive streaming digital human

Python 4,208 612 Updated Dec 29, 2024

Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"

Python 895 72 Updated Sep 13, 2024

LipSync for Unity3D 根据语音生成口型动画 支持fmod

CMake 426 77 Updated May 28, 2020

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,910 5,237 Updated Jun 27, 2024

The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!

Python 127 22 Updated Aug 2, 2023
TypeScript 1 Updated Apr 12, 2024

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 12,142 2,257 Updated Jun 26, 2024

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,052 2,339 Updated Nov 26, 2024

VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama

Python 2,119 324 Updated Oct 27, 2024

Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.

Python 55 7 Updated Dec 30, 2023

Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRChat and Overlays in most Streaming Applications

Python 406 31 Updated Dec 24, 2024

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Python 1,102 140 Updated Jul 12, 2024

腾讯会议摸鱼工具

C# 586 51 Updated Nov 21, 2024

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

C++ 8,684 745 Updated Aug 3, 2024

Real time transcription with OpenAI Whisper.

Python 2,466 415 Updated Jun 1, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 13,072 1,401 Updated Dec 18, 2024

A TensorFlow implementation of DeepMind's WaveNet paper

Python 5,420 1,292 Updated Jul 12, 2023

WaveNet vocoder

Python 2,334 500 Updated Jul 29, 2023

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,863 816 Updated Jul 5, 2024

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,325 1,866 Updated Dec 27, 2024

無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXのエディター

TypeScript 2,567 309 Updated Dec 29, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,478 8,779 Updated Dec 1, 2024

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高

Python 483 89 Updated Dec 4, 2024

Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2

Python 543 108 Updated Jun 10, 2023

使用python进行语音识别

Python 141 541 Updated Feb 16, 2022

中文语音识别; Mandarin Automatic Speech Recognition;

Python 1,894 482 Updated Jul 25, 2024

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 35,563 5,219 Updated Nov 15, 2024
Next