Skip to content
View TTS-Fyc's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report TTS-Fyc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 9,392 905 Updated Jan 10, 2025

Learning audio concepts from natural language supervision

Python 515 39 Updated Sep 18, 2024

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Python 785 96 Updated Sep 30, 2021

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,982 418 Updated May 10, 2023

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,079 322 Updated Nov 14, 2023

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 38,537 4,360 Updated Jan 2, 2025

End-to-End Speech Processing Toolkit

Python 8,671 2,202 Updated Jan 10, 2025

In defence of metric learning for speaker recognition

Python 1,077 275 Updated Mar 26, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,704 4,538 Updated Aug 16, 2024

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Jupyter Notebook 933 81 Updated Nov 4, 2024

A python package to analyze and compare voices with deep learning

Python 2,822 434 Updated Oct 12, 2023

无需情感标注的情感可控语音合成模型,基于VITS

Jupyter Notebook 1,347 167 Updated Mar 30, 2023

Repository for the paper: VoiceMe: Personalized voice generation in TTS

Python 126 21 Updated Apr 29, 2022

RROS is a dual-kernel OS for satellites or other scenarios that need both real-time and general-purpose abilities. RROS = RTOS (Rust) + Linux (C).

C 602 45 Updated Jan 3, 2025

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,661 1,944 Updated Apr 4, 2024

a TTS demo for training new characters.

Python 444 56 Updated Jan 5, 2024

vits2 backbone with multilingual-bert

Python 8,163 1,153 Updated Jan 9, 2025

基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏

Python 253 42 Updated Sep 10, 2023

订阅人数最多的rss源,中文优质rss源

3,732 125 Updated Dec 19, 2024
Python 1,404 181 Updated Feb 11, 2024

Manipulate audio with a simple and easy high level interface

Python 9,089 1,060 Updated Jul 25, 2024

Implementation of the VITS model

Jupyter Notebook 405 76 Updated Jul 21, 2023

基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

Python 831 130 Updated Jan 7, 2025

This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headles…

Python 3,262 369 Updated Jan 7, 2025

Voice dataset of Genshin Impact 原神语音数据集

691 52 Updated Jul 5, 2023

A python package for calculating the PESQ.

Python 367 70 Updated Apr 24, 2023

PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)

C 550 99 Updated Sep 5, 2024

vits chinese, tts chinese, tts mandarin 史上训练最简单,音质最好的语音合成系统

Python 213 71 Updated Sep 27, 2021

AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data

Jupyter Notebook 70 20 Updated Aug 31, 2021

基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。

Python 194 25 Updated Sep 15, 2022
Next