Skip to content
View TTS-Fyc's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report TTS-Fyc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
46 stars written in Python
Clear filter

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 38,544 4,360 Updated Jan 2, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,708 4,539 Updated Aug 16, 2024

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 35,631 5,221 Updated Nov 15, 2024

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,661 1,944 Updated Apr 4, 2024

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,363 1,866 Updated Jan 6, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 9,396 905 Updated Jan 10, 2025

A PyTorch-based Speech Toolkit

Python 9,167 1,416 Updated Jan 10, 2025

Manipulate audio with a simple and easy high level interface

Python 9,089 1,060 Updated Jul 25, 2024

End-to-End Speech Processing Toolkit

Python 8,671 2,202 Updated Jan 10, 2025

vits2 backbone with multilingual-bert

Python 8,163 1,153 Updated Jan 9, 2025

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,020 1,282 Updated Dec 6, 2023

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,260 1,096 Updated Jan 10, 2025

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,874 815 Updated Jul 5, 2024

This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headles…

Python 3,262 369 Updated Jan 7, 2025

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,982 418 Updated May 10, 2023

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Python 2,964 958 Updated Jul 6, 2023

A python package to analyze and compare voices with deep learning

Python 2,822 434 Updated Oct 12, 2023

DeepMind's Tacotron-2 Tensorflow implementation

Python 2,288 906 Updated Jul 6, 2023

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,079 322 Updated Nov 14, 2023

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 2,013 514 Updated Jul 27, 2024

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 1,897 547 Updated Oct 27, 2023

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Python 1,827 436 Updated Jan 17, 2022
Python 1,404 181 Updated Feb 11, 2024

Command line utility for forced alignment using Kaldi

Python 1,385 250 Updated Dec 2, 2024

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Python 1,137 225 Updated May 3, 2024

In defence of metric learning for speaker recognition

Python 1,077 275 Updated Mar 26, 2024

The Implementation of FastSpeech based on pytorch.

Python 862 213 Updated Jul 6, 2023

基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

Python 831 130 Updated Jan 7, 2025

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Python 785 96 Updated Sep 30, 2021

Chinese text normalization for speech processing

Python 642 146 Updated Mar 18, 2023
Next