Skip to content
View zhNfly's full-sized avatar

Block or report zhNfly

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,971 419 Updated May 10, 2023

X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion

Python 72 8 Updated Apr 1, 2024

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 677 51 Updated Nov 15, 2024

pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper

Python 20 5 Updated Jun 23, 2022

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,292 101 Updated Sep 24, 2023

This is a TTS model based on VITS that can control the output speech emotion through natural language and control the speaker through reference audio.

Python 3 2 Updated Aug 15, 2024

TransferTTS (Zero-Shot learning of VITS)

Python 92 12 Updated Sep 23, 2022

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Python 320 44 Updated Feb 21, 2022

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Jupyter Notebook 918 80 Updated Nov 4, 2024

VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai

Python 34 19 Updated Mar 19, 2024

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Multilingual Cleaners

Python 64 6 Updated Nov 21, 2022

Bilingual-TTS (Japanese and Korean)

Jupyter Notebook 30 5 Updated Jul 1, 2023

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Python 292 47 Updated Aug 25, 2021

Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)

Python 74 7 Updated Feb 28, 2024

Implementation of Korean FastSpeech2

Python 2 Updated Nov 26, 2020

The official implementation of EmoSphere-TTS

Python 89 8 Updated Aug 5, 2024

The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis”

Python 81 12 Updated Dec 20, 2022

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。

Python 255 46 Updated Mar 25, 2023