deeesp

JINWOO OH deeesp

Speech synthesis (TTS) Voice Conversion (VC), Source Separation (Music, speech enhancement, speech separation) MS. in EECS

19 followers · 9 following

Humelo Inc.
Seoul
deeesp.github.io

Achievements

Organizations

Stars

yl4579 / StyleTTS-ZS

StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion

162 10 Updated Sep 27, 2024

affige / genmusic_demo_list

a list of demo websites for automatic music generation research

643 43 Updated Nov 20, 2024

haidog-yaqub / EzAudio

High-quality Text-to-Audio Generation with Efficient Diffusion Transformer

Python 250 9 Updated Nov 12, 2024

WangHelin1997 / SoloAudio

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.

Python 71 6 Updated Nov 14, 2024

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,298 90 Updated Aug 13, 2024

jik876 / hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 1,990 510 Updated Jul 27, 2024

Rongjiehuang / GenerSpeech

PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.

Python 321 45 Updated Feb 9, 2024

facebookresearch / disentangling-correlated-factors

A benchmarking suite for disentanglement algorithms, suited for evaluating robustness to correlated factors. Codebase for the paper "Disentanglement of Correlated Factors via Hausdorff Factorized S…

Python 71 9 Updated Feb 25, 2023

ubisoft / ubisoft-laforge-disentanglement-metrics

Python 33 7 Updated Dec 18, 2020

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 5,031 427 Updated Aug 10, 2024

lucidrains / BS-RoFormer

Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs

Python 443 16 Updated Aug 6, 2024

kminito / srt_reservation

Python 64 60 Updated Oct 13, 2023

yl4579 / StyleTTS-VC

Official Implementation of StyleTTS-VC

Python 164 22 Updated Apr 23, 2023

KinWaiCheuk / demucs_lightning

Demucs Lightning: A PyTorch lightning version of Demucs with Hydra and Tensorboard features

Python 85 10 Updated May 3, 2023

facebookresearch / av_hubert

A self-supervised learning framework for audio-visual speech

Python 859 137 Updated Dec 7, 2023

Kyubyong / g2pK

g2pK: g2p module for Korean

Python 237 43 Updated Mar 1, 2022

Rongjiehuang / ProDiff

PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline

Python 434 55 Updated Apr 19, 2023

MoonInTheRiver / DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Python 4,361 716 Updated May 2, 2023

hccho2 / Tacotron2-Wavenet-Korean-TTS

Korean TTS, Tacotron2, Wavenet

Python 165 96 Updated Jun 15, 2020

maum-ai / voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Python 1,103 228 Updated Jul 25, 2024

rishikksh20 / Avocodo-pytorch

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Python 115 15 Updated Jul 14, 2022

lmnt-com / diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Python 789 113 Updated Mar 26, 2024

Rongjiehuang / FastDiff

PyTorch Implementation of FastDiff (IJCAI'22)

Python 410 64 Updated Jun 20, 2024

jungwoo-ha / WeeklyArxivTalk

[Zoom & Facebook Live] Weekly AI Arxiv 시즌2

972 41 Updated Aug 27, 2023

facebookresearch / demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 8,455 1,085 Updated Apr 24, 2024

jwkanggist / self-supervised-learning-narratives-1

거꾸로 읽는 self-supervised learning 파트 1

49 8 Updated Oct 30, 2022

knlee-voice / AI.Tech

Trends, Tools, News timeline ...

17 1 Updated Nov 4, 2024

anton-jeran / FAST-RIR

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

Python 155 28 Updated Jul 24, 2024

RookieJunChen / FullSubNet-plus

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Python 247 56 Updated Apr 23, 2024

kan-bayashi / ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Jupyter Notebook 1,579 343 Updated Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JINWOO OH deeesp

Achievements

Achievements

Organizations

Block or report deeesp

Stars

yl4579 / StyleTTS-ZS

affige / genmusic_demo_list

haidog-yaqub / EzAudio

WangHelin1997 / SoloAudio

QwenLM / Qwen2-Audio

jik876 / hifi-gan

Rongjiehuang / GenerSpeech

facebookresearch / disentangling-correlated-factors

ubisoft / ubisoft-laforge-disentanglement-metrics

yl4579 / StyleTTS2

lucidrains / BS-RoFormer

kminito / srt_reservation

yl4579 / StyleTTS-VC

KinWaiCheuk / demucs_lightning

facebookresearch / av_hubert

Kyubyong / g2pK

Rongjiehuang / ProDiff

MoonInTheRiver / DiffSinger

hccho2 / Tacotron2-Wavenet-Korean-TTS

maum-ai / voicefilter

rishikksh20 / Avocodo-pytorch

lmnt-com / diffwave

Rongjiehuang / FastDiff

jungwoo-ha / WeeklyArxivTalk

facebookresearch / demucs

jwkanggist / self-supervised-learning-narratives-1

knlee-voice / AI.Tech

anton-jeran / FAST-RIR

RookieJunChen / FullSubNet-plus

kan-bayashi / ParallelWaveGAN