Skip to content
View Orca0917's full-sized avatar
🚀
Focusing
🚀
Focusing

Organizations

@boostcampaitech3 @BOAZ-bigdata @boostcamp-AI-Tech-alumni

Block or report Orca0917

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Python 831 157 Updated Oct 10, 2023

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Jupyter Notebook 1,579 343 Updated Apr 22, 2024

Animation engine for explanatory math videos

Python 71,953 6,314 Updated Dec 13, 2024

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 768 99 Updated Dec 3, 2024

A Flow-based Generative Network for Speech Synthesis

Python 2,294 531 Updated Oct 19, 2023

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Python 668 151 Updated Jul 12, 2022

Online Judge(BOJ, Codeforces), algorithm study

C++ 4 Updated Oct 20, 2024

DE4E: Data Engineering for Everybody by Pseudo-Lab

Jupyter Notebook 66 13 Updated Sep 2, 2024

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Jupyter Notebook 893 177 Updated Jul 6, 2023

Official repository of SepReformer for speech separation

Python 153 14 Updated Nov 6, 2024

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 7,314 782 Updated Dec 14, 2024

Implementation of the paper: Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks (INTERSPEECH 2021)

Shell 29 5 Updated Jul 21, 2021

(ImageNet pretrained models) The official pytorch implemention of the TPAMI paper "Res2Net: A New Multi-scale Backbone Architecture"

Python 1,078 215 Updated Dec 8, 2022

Implementation of the paper: Replay and Synthetic Speech Detection with Res2Net architecture (ICASSP 2021) https://arxiv.org/abs/2010.15006

Shell 75 15 Updated Oct 21, 2021

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Jupyter Notebook 855 182 Updated Jul 22, 2023

g2pK: g2p module for Korean

Python 237 43 Updated Mar 1, 2022

A deep learning lyrics-to-audio alignment system, generating synchronized lyrics from a song and its lyrics

Jupyter Notebook 33 1 Updated Jun 5, 2023

Using temporal convolution to detect Audio Deepfakes

Python 352 87 Updated Nov 21, 2022

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Python 1,135 226 Updated May 3, 2024

👦 👧 Technical-Interview guidelines written for those who started studying programming. I wish you all the best. 👾

19,892 4,613 Updated Aug 9, 2024

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Python 2,959 957 Updated Jul 6, 2023

PyTorch implementation of Tacotron speech synthesis model.

Jupyter Notebook 309 79 Updated Jul 12, 2019

A simple implementation of Principal Component Analysis (PCA) visualized using Fashion MNIST Dataset. Thanks to https://github.com/zalandoresearch/fashion-mnist for making the dataset.

Jupyter Notebook 21 6 Updated Jan 5, 2021

Reconstruction and Compression of Color Images Using Principal Component Analysis (PCA) Algorithm

Python 34 9 Updated Jun 3, 2020

The python script show the image reconstructed using 200 principal components (out of 512).

Python 4 1 Updated Oct 27, 2019

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

2,996 515 Updated Oct 19, 2023

This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN, Non-attentive Tacotron, GST, VAE, GMVAE, and X-vectors for…

Python 74 12 Updated Sep 21, 2022

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,420 4,283 Updated Aug 19, 2024

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Python 292 47 Updated Aug 25, 2021

Implementation of Korean FastSpeech2

Python 214 51 Updated Jan 29, 2023
Next