Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec…

Python 42 4 Updated Jul 29, 2024

CorentinJ / librispeech-alignments

Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset

Python 155 23 Updated Mar 25, 2019

sp-uhh / sgmse_crp

Python 21 2 Updated Jan 9, 2024

sp-uhh / storm

StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation

Python 200 26 Updated Sep 13, 2024

neillu23 / CDiffuSE

Conditional Diffusion Probabilistic Model for Speech Enhancement

Python 221 34 Updated Dec 20, 2022

sp-uhh / sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 552 77 Updated Dec 30, 2024

openai / improved-diffusion

Release for Improved Denoising Diffusion Probabilistic Models

Python 3,376 496 Updated Jul 18, 2024

philsyn / DiffWave-unconditional

Pytorch Reimplementation of DiffWave unconditional generation: a high quality waveform synthesizer.

Python 35 5 Updated Apr 13, 2021

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 9,151 1,415 Updated Jan 4, 2025

shikiw / Modality-Integration-Rate

The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate".

Python 92 3 Updated Nov 27, 2024

gabrielmittag / NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Python 714 119 Updated Dec 1, 2024

abus-aikorea / voice-pro

Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube dow…

Python 2,474 186 Updated Dec 22, 2024

ShengjieJin / clash-for-linux-without-sudo

Forked from Elegycloud/clash-for-linux-backup

在没有sudo权限的情况下，在linux上使用clash

Shell 54 6 Updated Nov 14, 2024

hopkin-ghp / speech-enhancement-paper

语音增强论文，降噪、去混响等

13 2 Updated Jun 21, 2024

deep-privacy / SA-toolkit

SA-toolkit: Speaker speech anonymization toolkit in python

Python 19 1 Updated Jan 7, 2025

Tinglok / avsoundscape

8 Updated Sep 24, 2024

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,556 311 Updated Jan 4, 2024

sz3 / libcimbar

Optimized implementation for color-icon-matrix barcodes

C++ 4,593 323 Updated Dec 9, 2024

pyking / security_w1k1

Forked from euphrat1ca/Security-List

collect

45 17 Updated May 6, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fan Wei cyberrrange

Highlights

Block or report cyberrrange

Stars

LoCryptEn / Key-security

moatifbutt / awesome-diffusion-iclr-2025

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Plachtaa / seed-vc

bfs18 / e2_tts

Cazure8 / voiceguard-subnet

nanless / universal-speech-enhancement