Skip to content

Commit

Permalink
Update README
Browse files Browse the repository at this point in the history
  • Loading branch information
ming024 committed Dec 8, 2020
1 parent 4c734f4 commit e223e72
Showing 1 changed file with 7 additions and 0 deletions.
7 changes: 7 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -37,6 +37,13 @@ Here is a generated spectrogram of the sentence "Printing, in the only sense wit
![](./synth/LJSpeech/step_300000.png)

For CPU inference please refer to this [colab tutorial](https://colab.research.google.com/drive/1S60pytpB1OcEFrd-SkYyjtBsBHYepRSG?usp=sharing). One has to clone the original repo of [MelGAN](https://github.com/seungwonpark/melgan) instead of using ``torch.hub`` due to the code architecture of MelGAN.
## Controllability
The duration/pitch/energy of the synthesized utterances can be modified by specifying the desired duration/pitch/energy ratio to the predicted values.
For example, one can increase the speaking rate by 20 % and decrease the volume by 20 % by

```
python3 synthesis.py --step 300000 --duration_control 0.8 --energy_control 0.8
```

# Training

Expand Down

0 comments on commit e223e72

Please sign in to comment.