wwwei1997

Wei wwwei1997

5 followers · 12 following

xjtu
西安

Stars

Audio/TTS

28 repositories

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 38,311 4,800 Updated Aug 16, 2024

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,205 1,314 Updated Dec 6, 2023

ming024 / FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 1,967 558 Updated Oct 27, 2023

lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,101 321 Updated Nov 14, 2023

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 37,162 4,390 Updated Aug 19, 2024

lucidrains / naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,314 104 Updated Sep 24, 2023

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,820 776 Updated Feb 11, 2024

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 5,530 511 Updated Aug 10, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,655 673 Updated Mar 3, 2025

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,378 1,120 Updated Nov 14, 2024

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 8,305 1,171 Updated Mar 10, 2025

sh-lee-prml / HierSpeechpp

The official implementation of HierSpeech++

Python 1,213 148 Updated Feb 20, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,624 2,265 Updated Jan 15, 2025