Orca0917

🚀

Focusing

유종문 Orca0917

🚀

Focusing

30 followers · 36 following

Gachon University
Seoul, South Korea
03:03 (UTC +09:00)
https://killerwhale0917.tistory.com/
https://orca0917.github.io/

Achievements

x2 x2

Achievements

x2 x2

Highlights

Developer Program Member
Pro

Organizations

Lists (1)

Sort

Speech Synthesis

A curated paper list about `speech-synthesis` or implementations.

16 repositories

Stars

Tomiinek / Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Python 831 157 Updated Oct 10, 2023

kan-bayashi / ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Jupyter Notebook 1,579 343 Updated Apr 22, 2024

3b1b / manim

Animation engine for explanatory math videos

Python 71,953 6,314 Updated Dec 13, 2024

shivammehta25 / Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 768 99 Updated Dec 3, 2024

NVIDIA / waveglow

A Flow-based Generative Network for Speech Synthesis

Python 2,294 531 Updated Oct 19, 2023

jaywalnut310 / glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Python 668 151 Updated Jul 12, 2022

Orca0917 / Algorithm

Online Judge(BOJ, Codeforces), algorithm study

C++ 4 Updated Oct 20, 2024

Pseudo-Lab / data-engineering-for-everybody

DE4E: Data Engineering for Everybody by Pseudo-Lab

Jupyter Notebook 66 13 Updated Sep 2, 2024

NVIDIA / flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Jupyter Notebook 893 177 Updated Jul 6, 2023

dmlguq456 / SepReformer

Official repository of SepReformer for speech separation

Python 153 14 Updated Nov 6, 2024

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 7,314 782 Updated Dec 14, 2024

lixucuhk / Channel-wise-Gated-Res2Net

Implementation of the paper: Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks (INTERSPEECH 2021)

Shell 29 5 Updated Jul 21, 2021

Res2Net / Res2Net-PretrainedModels

(ImageNet pretrained models) The official pytorch implemention of the TPAMI paper "Res2Net: A New Multi-scale Backbone Architecture"

Python 1,078 215 Updated Dec 8, 2022

lixucuhk / ASV-anti-spoofing-with-Res2Net

Implementation of the paper: Replay and Synthetic Speech Detection with Res2Net architecture (ICASSP 2021) https://arxiv.org/abs/2010.15006

Shell 75 15 Updated Oct 21, 2021

NVIDIA / mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Jupyter Notebook 855 182 Updated Jul 22, 2023

Kyubyong / g2pK

g2pK: g2p module for Korean

Python 237 43 Updated Mar 1, 2022

mikezzb / lyrics-sync

A deep learning lyrics-to-audio alignment system, generating synchronized lyrics from a song and its lyrics

Jupyter Notebook 33 1 Updated Jun 5, 2023

dessa-oss / fake-voice-detection

Using temporal convolution to detect Audio Deepfakes

Python 352 87 Updated Nov 21, 2022

as-ideas / TransformerTTS

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Python 1,135 226 Updated May 3, 2024

JaeYeopHan / Interview_Question_for_Beginner

👦 👧 Technical-Interview guidelines written for those who started studying programming. I wish you all the best. 👾

19,892 4,613 Updated Aug 9, 2024

keithito / tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Python 2,959 957 Updated Jul 6, 2023

r9y9 / tacotron_pytorch

PyTorch implementation of Tacotron speech synthesis model.

Jupyter Notebook 309 79 Updated Jul 12, 2019

redwankarimsony / PCA-from-Scratch-in-Python

A simple implementation of Principal Component Analysis (PCA) visualized using Fashion MNIST Dataset. Thanks to https://github.com/zalandoresearch/fashion-mnist for making the dataset.

Jupyter Notebook 21 6 Updated Jan 5, 2021

NourozR / Reconstruction-and-Compression-of-Color-Images

Reconstruction and Compression of Color Images Using Principal Component Analysis (PCA) Algorithm

Python 34 9 Updated Jun 3, 2020

Shivank1006 / Image-compression-and-reconstruction-by-PCA

The python script show the image reconstructed using 200 principal components (out of 512).

Python 4 1 Updated Oct 27, 2019

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

2,996 515 Updated Oct 19, 2023

BridgetteSong / ExpressiveTacotron

This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for…

Python 74 12 Updated Sep 21, 2022

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,420 4,283 Updated Aug 19, 2024

keonlee9420 / Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Python 292 47 Updated Aug 25, 2021

HGU-DLLAB / Korean-FastSpeech2-Pytorch

Implementation of Korean FastSpeech2

Python 214 51 Updated Jan 29, 2023