Emotional VITS

在线demo ↑↑↑ bilibili demo

数据集无需任何情感标注，通过情感提取模型提取语句情感embedding输入网络，实现情感可控的VITS合成

模型结构

相对于原版VITS仅修改了TextEncoder部分

模型的优缺点介绍

该模型缺点：

推理时需要指定一个音频作为情感的参考音频才能够合成音频，而模型本身并不知道“激动”、“平静”这类表示情绪的词语对应的情感特征是什么。
对于只有一个角色的模型，可以通过预先筛选的方式，即手动挑选几条“激动”、“平静”、“小声”之类的音频，手动实现情感文本->情感embedding的对应关系（这个过程可以用聚类算法简化筛选）
对于有多个角色的模型，上述预筛选的方式有局限性，因为例如同样对于“平静”这一个情感而言，不同角色对应的情感embedding可能会不同，导致建立情感文本->情感embedding的映射关系很繁琐，很难通过一套统一的标准去描述不同角色之间的相似情感

该模型的优点：

任何普通的TTS数据集均可以完成情感控制。无需手动打情感标签。
由于在训练时候并没有指定情感的文本与embedding的对应关系，所有的情感特征embedding均在一个连续的空间内
因此理论上对于任意角色数据集中出现的情感，推理时均可以通过该模型实现合成，只需要输入目标情感音频对应的embedding即可，而不会受到情感分类数量限制

快速挑选各个情感对应的音频

可以使用 聚类算法 自动对音频的情感embedding进行分类，大致上可以区分出情感差异较大的各个类别，具体使用请参考 emotion_clustering.ipynb

Pre-requisites

Python >= 3.6
Clone this repository
Install python requirements. Please refer requirements.txt
prepare datasets
Build Monotonic Alignment Search and run preprocessing if you use your own datasets.

# Cython-version Monotonoic Alignment Search
cd monotonic_align
python setup.py build_ext --inplace

# Preprocessing (g2p) for your own datasets. Preprocessed phonemes for nene have been already provided.
python preprocess.py --text_index 2 --filelists filelists/train.txt filelists/val.txt --text_cleaners japanese_cleaners

extract emotional embeddings, this will generate *.emo.npy for each wav file.

python emotion_extract.py --filelists filelists/train.txt filelists/val.txt

Training Exmaple

# nene
python train_ms.py -c configs/nene.json -m nene

# if you are fine tuning pretrained original VITS checkpoint ,
python train_ms.py -c configs/nene.json -m nene --ckptD /path/to/D_xxxx.pth --ckptG /path/to/G_xxxx.pth

Inference Example

See inference.ipynb or use MoeGoe

Name	Name	Last commit message	Last commit date
Latest commit NaruseMioShirakana Add files via upload Feb 28, 2023 0bdc2ef · Feb 28, 2023 History 43 Commits
configs	configs	update fine tuning pretrained original VITS checkpoint script	Oct 29, 2022
filelists	filelists	fix	Oct 27, 2022
monotonic_align	monotonic_align	.	Jun 10, 2021
resources	resources	fix	Oct 29, 2022
text	text	update inference.ipynb	Oct 30, 2022
.gitignore	.gitignore	.	Jun 10, 2021
LICENSE	LICENSE	Initial commit	May 26, 2021
README.md	README.md	Update README.md	Nov 3, 2022
attentions.py	attentions.py	.	Jun 10, 2021
commons.py	commons.py	.	Jun 10, 2021
data_utils.py	data_utils.py	support single speaker & fix loading old model	Nov 10, 2022
emotion_clustering.ipynb	emotion_clustering.ipynb	添加聚类算法脚本	Oct 28, 2022
emotion_extract.py	emotion_extract.py	support single speaker & fix loading old model	Nov 10, 2022
emotional_vits_onnx_export.py	emotional_vits_onnx_export.py	Add files via upload	Feb 28, 2023
emotional_vits_onnx_model.py	emotional_vits_onnx_model.py	Add files via upload	Feb 28, 2023
emotional_vits_onnx_modules.py	emotional_vits_onnx_modules.py	Add files via upload	Feb 28, 2023
emotional_vits_onnx_transforms.py	emotional_vits_onnx_transforms.py	Add files via upload	Feb 28, 2023
inference.ipynb	inference.ipynb	update inference.ipynb	Oct 30, 2022
losses.py	losses.py	.	Jun 10, 2021
mel_processing.py	mel_processing.py	适配日语数据集训练	Aug 1, 2022
models.py	models.py	添加情感控制	Oct 26, 2022
modules.py	modules.py	Fix typo in modules.py	Nov 8, 2022
preprocess.py	preprocess.py	.	Jun 10, 2021
requirements.txt	requirements.txt	fix	Oct 27, 2022
train.py	train.py	support single speaker & fix loading old model	Nov 10, 2022
train_ms.py	train_ms.py	support single speaker & fix loading old model	Nov 10, 2022
transforms.py	transforms.py	.	Jun 10, 2021
utils.py	utils.py	support single speaker & fix loading old model	Nov 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Emotional VITS

模型结构

模型的优缺点介绍

快速挑选各个情感对应的音频

Pre-requisites

Training Exmaple

Inference Example

About

Releases

Packages

Languages

License

pan310/emotional-vits

Folders and files

Latest commit

History

Repository files navigation

Emotional VITS

模型结构

模型的优缺点介绍

快速挑选各个情感对应的音频

Pre-requisites

Training Exmaple

Inference Example

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages