Skip to content
View AveIZTZ's full-sized avatar

Block or report AveIZTZ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Combine sound source separation with SRP-PHAT to achieve multi-source localization.

Python 61 11 Updated Jan 22, 2025

Neural Network based Sound Source Localization Models

Python 34 9 Updated Aug 29, 2023

A PyTorch-based Speech Toolkit

Python 9,271 1,424 Updated Jan 22, 2025

speech enhancement\speech seperation\sound source localization

1,082 225 Updated Nov 14, 2023

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

Python 199 58 Updated Sep 30, 2022

End-to-End Speech Processing Toolkit

Python 8,719 2,209 Updated Jan 28, 2025

Speech enhancement in noisy and reverberant environments using deep neural networks

Python 20 4 Updated Oct 7, 2024

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,126 151 Updated Jan 27, 2025

This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhan…

Python 37 6 Updated Sep 6, 2023

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Python 353 52 Updated Oct 28, 2024

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Python 476 120 Updated Jul 1, 2021

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,743 814 Updated Jan 27, 2025

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 4,056 363 Updated Dec 18, 2024

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 812 126 Updated Jan 6, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 75,213 8,992 Updated Jan 4, 2025
Python 8 Updated Oct 2, 2024

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Python 251 56 Updated Apr 23, 2024

DCCRN: Deep Complex Convolution Recurrent Network

Python 6 2 Updated Nov 26, 2021

The PyTorch-based audio source separation toolkit for researchers

Python 2,314 427 Updated Jan 11, 2025
Python 8 Updated Oct 11, 2024

Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation

Python 100 16 Updated Jun 29, 2022

Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"

Python 194 44 Updated Apr 22, 2024

Offline CGMM and CGMM with spatial prior distribution in an online manner

Python 18 9 Updated Apr 19, 2019
Python 4 4 Updated May 21, 2024

Implementation of the CGMM-MVDR beamforming (for python version please refer to https://github.com/funcwj/setk)

Python 146 55 Updated Aug 12, 2020

Noise supression using deep filtering

Python 2,694 249 Updated Oct 17, 2024

robust RTFs by GCN

Python 4 1 Updated Aug 31, 2024
Next