-
F5-TTS Public
Forked from SWivid/F5-TTSOfficial code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Python MIT License UpdatedOct 21, 2024 -
e2-tts-pytorch Public
Forked from lucidrains/e2-tts-pytorchImplementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
Python MIT License UpdatedOct 16, 2024 -
conditional-flow-matching Public
Forked from atong01/conditional-flow-matchingTorchCFM: a Conditional Flow Matching library
Python MIT License UpdatedAug 21, 2024 -
-
gateloop-transformer Public
Forked from lucidrains/gateloop-transformerImplementation of GateLoop Transformer in Pytorch and Jax
Python MIT License UpdatedJan 31, 2024 -
ASVGP Public
Forked from HJakeCunningham/ASVGPActually Sparse Variational Gaussian Processes implemented in GPlow
Python UpdatedAug 9, 2023 -
bark Public
Forked from suno-ai/bark🔊 Text-Prompted Generative Audio Model
Python MIT License UpdatedJun 30, 2023 -
naturalspeech2-pytorch Public
Forked from lucidrains/naturalspeech2-pytorchImplementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Python MIT License UpdatedMay 24, 2023 -
tortoise-tts Public
Forked from neonbjb/tortoise-ttsA multi-voice TTS system trained with an emphasis on quality
Python Apache License 2.0 UpdatedApr 30, 2023 -
vall-e Public
Forked from enhuiz/vall-eAn unofficial PyTorch implementation of the audio LM VALL-E
Python MIT License UpdatedApr 25, 2023 -
phonemizer Public
Forked from bootphon/phonemizerSimple text to phones converter for multiple languages
Python GNU General Public License v3.0 UpdatedMar 23, 2023 -
pits Public
Forked from anonymous-pits/pitsPITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
Python MIT License UpdatedMar 10, 2023 -
silero-vad Public
Forked from snakers4/silero-vadSilero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector
Python MIT License UpdatedFeb 9, 2023 -
TTS-1 Public
Forked from coqui-ai/TTS🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Python Mozilla Public License 2.0 UpdatedOct 29, 2022 -
DeepLearningExamples Public
Forked from NVIDIA/DeepLearningExamplesDeep Learning Examples
Python UpdatedJul 24, 2022 -
pytorch-tvmisc Public
Forked from t-vi/pytorch-tvmiscTotally Versatile Miscellanea for Pytorch
Jupyter Notebook MIT License UpdatedMar 20, 2022 -
Robust_Fine_Grained_Prosody_Control Public
Forked from keonlee9420/Robust_Fine_Grained_Prosody_ControlPyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 20, 2022 -
variational-autoencoder Public
Forked from jaanli/variational-autoencoderVariational autoencoder implemented in tensorflow and pytorch (including inverse autoregressive flow)
Python MIT License UpdatedNov 11, 2021 -
flowtron Public
Forked from NVIDIA/flowtronFlowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer
Jupyter Notebook Apache License 2.0 UpdatedOct 25, 2021 -
Real-Time-Voice-Cloning Public
Forked from CorentinJ/Real-Time-Voice-CloningClone a voice in 5 seconds to generate arbitrary speech in real-time
Python Other UpdatedOct 16, 2021 -
streamlit-text-to-speech Public
Forked from android-iceland/streamlit-text-to-speechPython Apache License 2.0 UpdatedMay 30, 2021 -
stupid-simple-norm-flow Public
Forked from mrsalehi/stupid-simple-norm-flowJupyter Notebook UpdatedApr 2, 2021 -
TTS Public
Forked from mozilla/TTS🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Jupyter Notebook Mozilla Public License 2.0 UpdatedMar 13, 2021 -
Split-Audio-By-TextGrid Public
Forked from VegetableWithChicken/Split-Audio-By-TextGridit can split the Audio of .mp3 or .wav,and by read the .TextGrid file that created by annotate-TextGrid of Praat
Python UpdatedMar 2, 2021 -
py-webrtcvad Public
Forked from wiseman/py-webrtcvadPython interface to the WebRTC Voice Activity Detector
C Other UpdatedFeb 15, 2021 -
GST-Tacotron Public
Forked from KinglittleQ/GST-TacotronA PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Python MIT License UpdatedDec 9, 2020 -
VAD-python Public
Forked from marsbroshok/VAD-pythonVoice Activity Detector in Python
Python UpdatedNov 17, 2020 -
keras-ncp Public
Forked from mlech26l/ncpsCode repository of the paper Neural circuit policies enabling auditable autonomy published in Nature Machine Intelligence
Python Apache License 2.0 UpdatedNov 4, 2020 -
espnet Public
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
Python Apache License 2.0 UpdatedOct 4, 2020 -
GMVAE Public
Forked from jariasf/GMVAEImplementation of Gaussian Mixture Variational Autoencoder (GMVAE) for Unsupervised Clustering
Python MIT License UpdatedOct 2, 2020