Skip to content
View wownaoh9's full-sized avatar

Block or report wownaoh9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊

Python 94 16 Updated Oct 14, 2022

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Python 292 47 Updated Aug 25, 2021

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,971 419 Updated May 10, 2023

NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis

Python 145 11 Updated Feb 11, 2023

RelGAN: Multi-Domain Image-to-Image Translation via Relative Attributes

Python 75 18 Updated Jan 27, 2020

Python Implementation of Visual Relative Attributes for Image Classification and Zero Shot Learning

Python 21 6 Updated Jun 14, 2018
HTML 1 Updated Oct 25, 2022

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 21,050 3,098 Updated Nov 24, 2024

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Python 620 53 Updated Oct 1, 2024

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 765 99 Updated Dec 3, 2024

基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏

Python 250 42 Updated Sep 10, 2023

A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.

Jupyter Notebook 44 4 Updated May 25, 2023

An unofficial PyTorch implementation of Mix-Phoneme-Bert

Python 39 7 Updated Jul 10, 2023
Python 1,398 182 Updated Feb 11, 2024

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,483 225 Updated Dec 9, 2024

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,292 101 Updated Sep 24, 2023