Stars
A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
The official implementation of GTCRN, an ultra-lite speech enhancement model.
A training code template for DNN-based speech enhancement.
An example of a speech enhancement model deployed with TensorRT.
Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.
Only implemented through torch: "bi - mamba2" , "vision- mamba2 -torch". support 1d/2d/3d/nd and support export by jit.script/onnx;
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction
Python implementation of performance metrics in Loizou's Speech Enhancement book
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
PyTorch implementation of the Flash Spectral Transform Unit.
Expressive Anechoic Recordings of Speech (EARS)
The official project website of "Omni-Dimensional Dynamic Convolution" (ODConv for short, spotlight in ICLR 2022).
Phase-Aware Speech Enhancement with Deep Complex U-Net
Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enhancemen
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
Convolutional recurrent network in pytorch
Visualizer for neural network, deep learning and machine learning models
The PyTorch-based audio source separation toolkit for researchers
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Latex code for making neural networks diagrams
《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。
Code for the book Deep Learning with PyTorch by Eli Stevens, Luca Antiga, and Thomas Viehmann.
Conditional Diffusion Probabilistic Model for Speech Enhancement