Andong-Li-speech

🎯

Focusing

AndongLi Andong-Li-speech

🎯

Focusing

Institute of Acoustics, Chinese Academy of Sciences (IACAS).

354 followers · 140 following

Beijing, China
23:09 (UTC +08:00)
https://andong-li-speech.github.io

Achievements

Stars

wl-zhao / UniPC

[NeurIPS 2023] UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models

Jupyter Notebook 310 13 Updated Sep 22, 2023

Audio-WestlakeU / RealMAN

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]

Python 109 11 Updated Dec 11, 2024

zsyOAOA / ResShift

ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024)

Python 1,027 54 Updated Dec 31, 2024

resemble-ai / resemble-enhance

AI powered speech denoising and enhancement

Python 1,572 170 Updated Dec 3, 2024

audiolabs / torch-pesq

PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio

Python 161 15 Updated Jul 14, 2023

dongzhuoyao / awesome-flow-matching

A summary of related works about flow matching, stochastic interpolants

366 13 Updated Jul 29, 2024

X-LANCE / VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Python 323 21 Updated Sep 3, 2024

felixperfler / Stable-Hybrid-Auditory-Filterbanks

[Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement

Python 34 1 Updated Dec 2, 2024

qiuk2 / AAR

[Official Implementation] Acoustic Autoregressive Modeling 🔥

Python 59 5 Updated Aug 24, 2024

sh-lee-prml / PeriodWave

The official Implementation of PeriodWave and PeriodWave-Turbo

144 7 Updated Dec 17, 2024

ShenranTomWang / Matcha-TTS

Python 6 1 Updated Nov 19, 2024

wetdog / wavenext_pytorch

Unofficial implementation of wavenext vocoder

Python 39 5 Updated Aug 28, 2024

LqNoob / vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 6 Updated May 30, 2024

wavmark / wavmark

AI-based Audio Watermarking Tool

Python 239 32 Updated Jan 7, 2024

EIHW / u-dit-tts

HTML 8 1 Updated Sep 18, 2023

lucidrains / voicebox-pytorch

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Python 625 53 Updated Oct 1, 2024

Xiaobin-Rong / gtcrn

The official implementation of GTCRN, an ultra-lite speech enhancement model.

Python 247 43 Updated Jan 1, 2025

csteinmetz1 / pyloudnorm

Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm

Python 671 58 Updated Jul 2, 2024

haoheliu / SemantiCodec-inference

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 180 11 Updated Aug 25, 2024

zelokuo / VPIDM

This is official repository of new SOTA diffusion models based method for speech enhancement

Python 34 8 Updated Jul 31, 2024

sp-uhh / ears_benchmark

Generation scripts for EARS-WHAM and EARS-Reverb

Python 27 3 Updated Sep 16, 2024

Emrys365 / DNS_text

Transcripts of the DNS Challenge test sets

6 Updated Jul 7, 2023

urgent-challenge / urgent2024_challenge

Official data preparation scripts for the URGENT 2024 Challenge

Python 75 5 Updated Jan 9, 2025

BakerBunker / FreeV

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

Python 86 7 Updated Jul 4, 2024

WangHelin1997 / Fast-GeCo

Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction

Python 37 Updated Nov 19, 2024

facebookresearch / ears_dataset

Expressive Anechoic Recordings of Speech (EARS)

Python 140 7 Updated Jun 25, 2024

Emrys365 / se-scaling

Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement"

Python 32 3 Updated Aug 7, 2024

jacobgil / pytorch-tensor-decompositions

PyTorch implementation of [1412.6553] and [1511.06530] tensor decomposition methods for convolutional layers.

Python 279 63 Updated Dec 1, 2021

SJTU-DeepVisionLab / FLoRA

Python 33 1 Updated Jul 22, 2024

vb000 / SemanticHearing

Real-time binaural target sound extraction model.

Python 78 13 Updated Mar 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AndongLi Andong-Li-speech

Achievements

Achievements

Block or report Andong-Li-speech

Stars

wl-zhao / UniPC

Audio-WestlakeU / RealMAN

zsyOAOA / ResShift

resemble-ai / resemble-enhance

audiolabs / torch-pesq

dongzhuoyao / awesome-flow-matching

X-LANCE / VoiceFlow-TTS

felixperfler / Stable-Hybrid-Auditory-Filterbanks

qiuk2 / AAR

sh-lee-prml / PeriodWave

ShenranTomWang / Matcha-TTS

wetdog / wavenext_pytorch

LqNoob / vocos

wavmark / wavmark

EIHW / u-dit-tts

lucidrains / voicebox-pytorch

Xiaobin-Rong / gtcrn

csteinmetz1 / pyloudnorm

haoheliu / SemantiCodec-inference

zelokuo / VPIDM

sp-uhh / ears_benchmark

Emrys365 / DNS_text

urgent-challenge / urgent2024_challenge

BakerBunker / FreeV

WangHelin1997 / Fast-GeCo

facebookresearch / ears_dataset

Emrys365 / se-scaling

jacobgil / pytorch-tensor-decompositions

SJTU-DeepVisionLab / FLoRA

vb000 / SemanticHearing