AveIZTZ

Follow

Yujie Zhu AveIZTZ

Follow

2 followers · 9 following

Stars

BrownsugarZeer / Multi_SSL

Combine sound source separation with SRP-PHAT to achieve multi-source localization.

Python 61 11 Updated Jan 22, 2025

idiap / nnsslm

Neural Network based Sound Source Localization Models

Python 34 9 Updated Aug 29, 2023

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 9,271 1,424 Updated Jan 22, 2025

WenzheLiu-Speech / awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

1,082 225 Updated Nov 14, 2023

echocatzh / MTFAA-Net

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

Python 199 58 Updated Sep 30, 2022

espnet / espnet

End-to-End Speech Processing Toolkit

Python 8,719 2,209 Updated Jan 28, 2025

philgzl / brever

Speech enhancement in noisy and reverberant environments using deep neural networks

Python 20 4 Updated Oct 7, 2024

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,126 151 Updated Jan 27, 2025

alibabasglab / D2Former

This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhan…

Python 37 6 Updated Sep 6, 2023

yxlu-0102 / MP-SENet

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Python 353 52 Updated Oct 28, 2024

cszheng-ioa / Sixty-years-of-frequency-domain-monaural-speech-enhancement

Python 135 27 Updated Jan 30, 2024

taylorlu / Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Python 476 120 Updated Jul 1, 2021

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,743 814 Updated Jan 27, 2025

MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 4,056 363 Updated Dec 18, 2024

wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 812 126 Updated Jan 6, 2025

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 75,213 8,992 Updated Jan 4, 2025

salman18376 / SE-SSL

Python 8 Updated Oct 2, 2024

RookieJunChen / FullSubNet-plus

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Python 251 56 Updated Apr 23, 2024

PoKoHA / Speech_Enhancement-DCCRN

DCCRN: Deep Complex Convolution Recurrent Network

Python 6 2 Updated Nov 26, 2021

asteroid-team / asteroid

The PyTorch-based audio source separation toolkit for researchers

Python 2,314 427 Updated Jan 11, 2025

introlab / uimvdr

Python 8 Updated Oct 11, 2024

felixfuyihui / Uformer

Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation

Python 100 16 Updated Jun 29, 2022

Le-Xiaohuai-speech / DPCRN_DNS3

Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"

Python 194 44 Updated Apr 22, 2024

bobondemon / online-offline-CGMM-for-MVDR

Offline CGMM and CGMM with spatial prior distribution in an online manner

Python 18 9 Updated Apr 19, 2019

AkojimaSLP / Neural-mask-estimation

Python 39 9 Updated Dec 5, 2019

FrancoisGrondin / mvdrpf

Python 4 4 Updated May 21, 2024

AkojimaSLP / Frame-by-frame-closed-form-update-for-mask-based-adaptive-MVDR-beamforming

speech-enhacement

Python 50 17 Updated Nov 5, 2019

funcwj / CGMM-MVDR

Implementation of the CGMM-MVDR beamforming (for python version please refer to https://github.com/funcwj/setk)

Python 146 55 Updated Aug 12, 2020

Rikorose / DeepFilterNet

Noise supression using deep filtering

Python 2,694 249 Updated Oct 17, 2024

levidaniel96 / peerRTF

robust RTFs by GCN

Python 4 1 Updated Aug 31, 2024