Skip to content
View dori2063's full-sized avatar
  • GIST
  • Gwangju, Republic of Korea

Highlights

  • Pro

Block or report dori2063

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository of SepReformer for speech separation

Python 157 14 Updated Dec 18, 2024

An awesome spoken LID repository. (Working in progress

Python 97 10 Updated Apr 22, 2024

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 66,026 8,025 Updated Dec 20, 2024

unofficial vits2-TTS implementation in pytorch

Python 497 91 Updated Mar 28, 2024

Easy-to-Use Speech MOS predictors

Python 240 16 Updated Oct 24, 2023

Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023

Python 11 1 Updated May 13, 2024

[AAAI-23 Oral] Official implementation of the paper "Are Transformers Effective for Time Series Forecasting?"

Python 2,058 454 Updated Jan 27, 2024

Implementation of "End-to-End Speaker Diarization as Post-Processing"

Python 2 Updated May 24, 2023

PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor

Python 277 34 Updated Jul 16, 2023

ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'

Python 37 4 Updated Oct 31, 2022

code for "Supervised Prototypical Contrastive Learning for Emotion Recognition in Conversation, EMNLP 22"

Python 76 8 Updated Feb 9, 2023

This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.

Python 339 18 Updated Sep 1, 2023
Python 49 9 Updated Jul 25, 2024

[IJCAI'23] Learning to Speak from Text for Low-Resource TTS

Python 64 2 Updated May 30, 2023

Some comprehensive papers about speaker diarization

237 5 Updated Nov 12, 2024

This is an official implementation for "Block Selection Method for Using Feature Norm in Out-of-distribution Detection".

Python 22 2 Updated May 21, 2024

A deep neural network architecture for low-latency audio processing

Python 291 34 Updated Aug 15, 2023

Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Units.

Python 73 9 Updated Jan 7, 2023

The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"

Python 18 1 Updated May 18, 2023

Official Pytorch implementation of "Graphit: A Unified Framework for Diverse Image Editing Tasks"

Python 200 11 Updated May 1, 2023

Source code for ICASSP 2022 paper "MM-DFN: Multimodal Dynamic Fusion Network For Emotion Recognition in Conversations"

Python 83 12 Updated Apr 21, 2023

ICASSP 2023 Accepted

Python 190 14 Updated May 6, 2024

The proposed framework to retrieve the continuous chunk-level emotions via emo-rankers for Seq2Seq SER

Python 2 Updated Aug 10, 2023

Code for "Distribution-based Emotion Recognition in Conversation"

Python 19 1 Updated Feb 6, 2023

How to use our public wav2vec2 dimensional emotion model

Jupyter Notebook 469 49 Updated May 22, 2023

S3PRL-VC: A Voice Conversion Toolkit based on S3PRL

Python 97 12 Updated Jun 26, 2024

[ICASSP 2023] FedAudio: A Federated Learning Benchmark for Audio and Speech Tasks

Python 46 1 Updated Feb 21, 2024

Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"

Python 49 Updated Nov 10, 2022

Official implement of SpeechFormer written in Python (PyTorch).

Python 76 8 Updated Apr 1, 2023
Next