Skip to content
View Andong-Li-speech's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Andong-Li-speech

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2023] UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models

Jupyter Notebook 310 13 Updated Sep 22, 2023

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]

Python 109 11 Updated Dec 11, 2024

ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024)

Python 1,027 54 Updated Dec 31, 2024

AI powered speech denoising and enhancement

Python 1,572 170 Updated Dec 3, 2024

PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio

Python 161 15 Updated Jul 14, 2023

A summary of related works about flow matching, stochastic interpolants

366 13 Updated Jul 29, 2024

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Python 323 21 Updated Sep 3, 2024

[Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement

Python 34 1 Updated Dec 2, 2024

[Official Implementation] Acoustic Autoregressive Modeling 🔥

Python 59 5 Updated Aug 24, 2024

The official Implementation of PeriodWave and PeriodWave-Turbo

144 7 Updated Dec 17, 2024
Python 6 1 Updated Nov 19, 2024

Unofficial implementation of wavenext vocoder

Python 39 5 Updated Aug 28, 2024

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 6 Updated May 30, 2024

AI-based Audio Watermarking Tool

Python 239 32 Updated Jan 7, 2024
HTML 8 1 Updated Sep 18, 2023

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Python 625 53 Updated Oct 1, 2024

The official implementation of GTCRN, an ultra-lite speech enhancement model.

Python 247 43 Updated Jan 1, 2025

Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm

Python 671 58 Updated Jul 2, 2024

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 180 11 Updated Aug 25, 2024

This is official repository of new SOTA diffusion models based method for speech enhancement

Python 34 8 Updated Jul 31, 2024

Generation scripts for EARS-WHAM and EARS-Reverb

Python 27 3 Updated Sep 16, 2024

Transcripts of the DNS Challenge test sets

6 Updated Jul 7, 2023

Official data preparation scripts for the URGENT 2024 Challenge

Python 75 5 Updated Jan 9, 2025

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

Python 86 7 Updated Jul 4, 2024

Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction

Python 37 Updated Nov 19, 2024

Expressive Anechoic Recordings of Speech (EARS)

Python 140 7 Updated Jun 25, 2024

Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement"

Python 32 3 Updated Aug 7, 2024

PyTorch implementation of [1412.6553] and [1511.06530] tensor decomposition methods for convolutional layers.

Python 279 63 Updated Dec 1, 2021
Python 33 1 Updated Jul 22, 2024

Real-time binaural target sound extraction model.

Python 78 13 Updated Mar 28, 2024
Next