TTS-Fyc

🎯

Focusing

TTS-Fyc

🎯

Focusing

0 followers · 3 following

Lists (7)

Sort

Stars

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 9,392 905 Updated Jan 10, 2025

microsoft / CLAP

Learning audio concepts from natural language supervision

Python 515 39 Updated Sep 18, 2024

AndreyGuzhov / AudioCLIP

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Python 785 96 Updated Sep 30, 2021

enhuiz / vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,982 418 Updated May 10, 2023

lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,079 322 Updated Nov 14, 2023

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 38,537 4,360 Updated Jan 2, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 8,671 2,202 Updated Jan 10, 2025

clovaai / voxceleb_trainer

In defence of metric learning for speaker recognition

Python 1,077 275 Updated Mar 26, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,704 4,538 Updated Aug 16, 2024

Edresson / YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Jupyter Notebook 933 81 Updated Nov 4, 2024

resemble-ai / Resemblyzer

A python package to analyze and compare voices with deep learning

Python 2,822 434 Updated Oct 12, 2023

innnky / emotional-vits

无需情感标注的情感可控语音合成模型，基于VITS

Jupyter Notebook 1,347 167 Updated Mar 30, 2023

polvanrijn / VoiceMe

Repository for the paper: VoiceMe: Personalized voice generation in TTS

Python 126 21 Updated Apr 29, 2022

BUPT-OS / RROS

RROS is a dual-kernel OS for satellites or other scenarios that need both real-time and general-purpose abilities. RROS = RTOS (Rust) + Linux (C).

C 602 45 Updated Jan 3, 2025

kaixindelele / ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,661 1,944 Updated Apr 4, 2024

anyvoiceai / MassTTS

a TTS demo for training new characters.

Python 444 56 Updated Jan 5, 2024

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 8,163 1,153 Updated Jan 9, 2025

Executedone / Chinese-FastSpeech2

基于标贝数据继续训练，同时对原本的FastSpeech2模型做了改进，引入了韵律表征以及韵律预测模块，使中文发音更生动且富有节奏

Python 253 42 Updated Sep 10, 2023

weekend-project-space / top-rss-list

订阅人数最多的rss源，中文优质rss源

3,732 125 Updated Dec 19, 2024

microsoft / NeuralSpeech

Python 1,404 181 Updated Feb 11, 2024

jiaaro / pydub

Manipulate audio with a simple and easy high level interface

Python 9,089 1,060 Updated Jul 25, 2024

qiaolinwang / VITS

Forked from AlexandaJerry/vits-mandarin-biaobei

Implementation of the VITS model

Jupyter Notebook 405 76 Updated Jul 21, 2023

TTS-Fyc

Lists (7)

ASR

Dataset tools

ML-basic

SR

SSP

TP

TTS

Stars