Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
configs		configs
filelists		filelists
misc		misc
monotonic_align		monotonic_align
text		text
README.md		README.md
attentions.py		attentions.py
commons.py		commons.py
data_utils.py		data_utils.py
losses.py		losses.py
mel_processing.py		mel_processing.py
models.py		models.py
modules.py		modules.py
requirements.txt		requirements.txt
stft.py		stft.py
train.py		train.py
transforms.py		transforms.py
utils.py		utils.py
vits_strings.py		vits_strings.py

Repository files navigation

vits实现的中文TTS

this is the copy of https://github.com/jaywalnut310/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Espnet连接：github.com/espnet/espnet/tree/master/espnet2/gan_tts/vits

coqui-ai/TTS连接：github.com/coqui-ai/TTS/tree/main/recipes/ljspeech/vits_tts

base on:https://github.com/lutianxiong/vits_chinese

如果有侵权行为，请联系我，我将删除项目

If there is infringement, please contact me and I will delete the item

基于VITS 实现 16K baker TTS 的流程记录

pip install -r requirements.txt

cd monotonic_align

python setup.py build_ext --inplace

将16K标贝音频拷贝到./baker_waves/，启动训练

python train.py -c configs/baker_base.json -m baker_base

两张1080卡，训练两天，基本可以使用了

测试

python vits_strings.py

PQMF+iSTFT

直接训练PQMF+iSTFT模型，出现文本编码和时长出错、而VAE编码再解码正确

故采用训练策略：训练基础VITS模型进行迁移训练，只训练PQMF+iSTFT的HiFi-GAN解码器

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

基于VITS 实现 16K baker TTS 的流程记录

将16K标贝音频拷贝到./baker_waves/，启动训练

测试

PQMF+iSTFT

About

Releases

Packages

Languages

tuong-olli/vits_chinese

Folders and files

Latest commit

History

Repository files navigation

基于VITS 实现 16K baker TTS 的流程记录

将16K标贝音频拷贝到./baker_waves/，启动训练

测试

PQMF+iSTFT

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages