Stars
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
[ICCV 2023] MADAug: When to Learn What: Model-Adaptive Data Augmentation Curriculum
Code of "Deep invariant networks with differentiable augmentation layers"
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Official PyTorch repository for Dual Quaternion Ambisonics Array for Six-Degree-of-Freedom Acoustic Representation
A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]
A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]
This is the public repository for eigenvector-based SALSA features for polyphonic sound event localization and detection.
Gammatone-based spectrograms, using gammatone filterbanks or Fourier transform weightings.
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
Production First and Production Ready End-to-End Speech Recognition Toolkit
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection
Our DCASE 2019 challenge task 3 method
Baseline method for sound event localization task of DCASE 2022 challenge
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytorch, the well known deep learning toolkit.
You can find the speech algorithms you want here
《语音信号处理试验教程》(梁瑞宇等)的代码主要是Matlab实现的,现在Python比较热门,所以把这个项目大部分内容写成了Python实现
General purpose sound recognition demo
免费带你学 django 全栈!基于 django 2.2 的个人博客,初学者绝对不能错过的 django 教程!。◕ᴗ◕。
有趣的Python爬虫和Python数据分析小项目(Some interesting Python crawlers and data analysis projects)