dori2063

Youngdo Ahn dori2063

🏃‍♂️

8 followers · 22 following

GIST
Gwangju, Republic of Korea

Achievements

Highlights

Stars

dmlguq456 / SepReformer

Official repository of SepReformer for speech separation

Python 157 14 Updated Dec 18, 2024

Lhx94As / Awesome-Spoken-Language-Identification

An awesome spoken LID repository. (Working in progress

Python 97 10 Updated Apr 22, 2024

Takaaki-Saeki / ssl_speech_restoration_v2

Python 14 1 Updated Dec 18, 2023

abi / screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 66,026 8,025 Updated Dec 20, 2024

p0p4k / vits2_pytorch

unofficial vits2-TTS implementation in pytorch

Python 497 91 Updated Mar 28, 2024

tarepan / SpeechMOS

Easy-to-Use Speech MOS predictors

Python 240 16 Updated Oct 24, 2023

RanaCM / DSU-AVO

Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023

Python 11 1 Updated May 13, 2024

cure-lab / LTSF-Linear

[AAAI-23 Oral] Official implementation of the paper "Are Transformers Effective for Time Series Forecasting?"

Python 2,058 454 Updated Jan 27, 2024

DongKeon / EENDasP

Implementation of "End-to-End Speaker Diarization as Post-Processing"

Python 2 Updated May 24, 2023

anonymous-pits / pits

PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor

Python 277 34 Updated Jul 16, 2023

TaoRuijie / AVCleanse

ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'

Python 37 4 Updated Oct 31, 2022

caskcsg / SPCL

code for "Supervised Prototypical Contrastive Learning for Emotion Recognition in Conversation, EMNLP 22"

Python 76 8 Updated Feb 9, 2023

bytedance / uss

This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.

Python 339 18 Updated Sep 1, 2023

butterfliesss / SDT

Python 49 9 Updated Jul 25, 2024

Takaaki-Saeki / zm-text-tts

[IJCAI'23] Learning to Speak from Text for Low-Resource TTS

Python 64 2 Updated May 30, 2023

DongKeon / Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

237 5 Updated Nov 12, 2024

gist-ailab / block-selection-for-OOD-detection

This is an official implementation for "Block Selection Method for Using Feature Norm in Out-of-distribution Detection".

Python 22 2 Updated May 21, 2024

vb000 / Waveformer

A deep neural network architecture for low-latency audio processing

Python 291 34 Updated Aug 15, 2023

b04901014 / UUVC

Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Units.

Python 73 9 Updated Jan 7, 2023

GeWu-Lab / MMCosine_ICASSP23

The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"

Python 18 1 Updated May 18, 2023

navervision / Graphit

Official Pytorch implementation of "Graphit: A Unified Framework for Diverse Image Editing Tasks"

Python 200 11 Updated May 1, 2023

zerohd4869 / MM-DFN

Source code for ICASSP 2022 paper "MM-DFN: Multimodal Dynamic Fusion Network For Emotion Recognition in Conversations"

Python 83 12 Updated Apr 21, 2023

maum-ai / phaseaug

ICASSP 2023 Accepted

Python 190 14 Updated May 6, 2024

winston-lin-wei-cheng / Chunk-Level-Emotion-Retrieval

The proposed framework to retrieve the continuous chunk-level emotions via emo-rankers for Seq2Seq SER

Python 2 Updated Aug 10, 2023

W-Wu / ERC-SLT22

Code for "Distribution-based Emotion Recognition in Conversation"

Python 19 1 Updated Feb 6, 2023

audeering / w2v2-how-to

How to use our public wav2vec2 dimensional emotion model

Jupyter Notebook 469 49 Updated May 22, 2023

unilight / s3prl-vc

S3PRL-VC: A Voice Conversion Toolkit based on S3PRL

Python 97 12 Updated Jun 26, 2024

zhang-tuo-pdf / FedAudio

[ICASSP 2023] FedAudio: A Federated Learning Benchmark for Audio and Speech Tasks

Python 46 1 Updated Feb 21, 2024

microsoft / WavText5K

Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"

Python 49 Updated Nov 10, 2022

HappyColor / SpeechFormer

Official implement of SpeechFormer written in Python (PyTorch).

Python 76 8 Updated Apr 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Youngdo Ahn dori2063

Achievements

Achievements

Highlights

Block or report dori2063

Stars

dmlguq456 / SepReformer

Lhx94As / Awesome-Spoken-Language-Identification

Takaaki-Saeki / ssl_speech_restoration_v2

abi / screenshot-to-code

p0p4k / vits2_pytorch

tarepan / SpeechMOS

RanaCM / DSU-AVO

cure-lab / LTSF-Linear

DongKeon / EENDasP

anonymous-pits / pits

TaoRuijie / AVCleanse

caskcsg / SPCL

bytedance / uss

butterfliesss / SDT

Takaaki-Saeki / zm-text-tts

DongKeon / Awesome-Speaker-Diarization

gist-ailab / block-selection-for-OOD-detection

vb000 / Waveformer

b04901014 / UUVC

GeWu-Lab / MMCosine_ICASSP23

navervision / Graphit

zerohd4869 / MM-DFN

maum-ai / phaseaug

winston-lin-wei-cheng / Chunk-Level-Emotion-Retrieval

W-Wu / ERC-SLT22

audeering / w2v2-how-to

unilight / s3prl-vc

zhang-tuo-pdf / FedAudio

microsoft / WavText5K

HappyColor / SpeechFormer