Skip to content
View xiaocui-iii's full-sized avatar

Block or report xiaocui-iii

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue

Vue 228 13 Updated Jan 7, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,258 164 Updated Feb 14, 2025

The official implementation of GTCRN, an ultra-lite speech enhancement model.

Python 258 47 Updated Jan 1, 2025

A training code template for DNN-based speech enhancement.

Python 67 25 Updated Feb 10, 2025

An example of a speech enhancement model deployed with TensorRT.

Python 45 7 Updated Jan 10, 2024

Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.

Python 86 10 Updated Aug 29, 2024

Audio waveform player

TypeScript 9,111 1,668 Updated Feb 13, 2025

Only implemented through torch: "bi - mamba2" , "vision- mamba2 -torch". support 1d/2d/3d/nd and support export by jit.script/onnx;

Python 264 11 Updated Dec 11, 2024

Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction

Python 66 5 Updated Feb 8, 2025

Python implementation of performance metrics in Loizou's Speech Enhancement book

Python 405 88 Updated Feb 15, 2025
Python 1,015 312 Updated Feb 4, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 8,281 861 Updated Feb 18, 2025

Perceptual Quality Estimator for speech and audio

C++ 739 130 Updated Aug 2, 2024

PyTorch implementation of the Flash Spectral Transform Unit.

Python 15 3 Updated Sep 19, 2024

Expressive Anechoic Recordings of Speech (EARS)

Python 148 7 Updated Jun 25, 2024

The official project website of "Omni-Dimensional Dynamic Convolution" (ODConv for short, spotlight in ICLR 2022).

Python 305 27 Updated Sep 6, 2023

Phase-Aware Speech Enhancement with Deep Complex U-Net

Python 83 23 Updated Nov 4, 2019

Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enhancemen

Python 32 5 Updated Jul 21, 2024

语音方向实验室/公司/资源/实习等,欢迎推荐或自荐

543 68 Updated Nov 13, 2024

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 576 77 Updated Jan 27, 2025

Convolutional recurrent network in pytorch

Python 2,424 656 Updated Sep 19, 2024

Visualizer for neural network, deep learning and machine learning models

JavaScript 29,416 2,851 Updated Feb 20, 2025

End-to-End Speech Processing Toolkit

Python 8,793 2,221 Updated Feb 5, 2025

The PyTorch-based audio source separation toolkit for researchers

Python 2,325 426 Updated Jan 11, 2025

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

14,221 1,423 Updated Feb 13, 2023

Latex code for making neural networks diagrams

TeX 22,749 2,920 Updated Aug 21, 2023

《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。

Jupyter Notebook 3,137 349 Updated Jan 27, 2025

Code for the book Deep Learning with PyTorch by Eli Stevens, Luca Antiga, and Thomas Viehmann.

Jupyter Notebook 4,858 2,049 Updated Jul 25, 2024

Conditional Diffusion Probabilistic Model for Speech Enhancement

Python 224 35 Updated Dec 20, 2022
Next