kiwi1kkkkk

kiwi1kkkkk

Lists (4)

Sort

Stars

nanahou / Awesome-Speech-Enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

MATLAB 737 150 Updated Dec 1, 2020

maggie0830 / DCCRN

implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch

Python 184 32 Updated Oct 8, 2020

BUTSpeechFIT / speakerbeam

Jupyter Notebook 109 18 Updated Oct 25, 2021

kaituoxu / Conv-TasNet

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

Python 690 156 Updated Apr 6, 2023

TaoRuijie / ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Python 630 115 Updated Apr 11, 2024

ethanjperez / film

Forked from facebookresearch/clevr-iep

FiLM: Visual Reasoning with a General Conditioning Layer

Python 327 54 Updated Jan 11, 2022

oucxlw / asa

attention-based scaling adaptation for target speech extraction

Python 1 Updated Oct 22, 2020

JusperLee / Conv-TasNet

Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement

Python 448 77 Updated May 26, 2023

facebookresearch / demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 8,593 1,113 Updated Apr 24, 2024

youngyangyang04 / leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

Shell 53,535 11,745 Updated Jan 27, 2025

jsalt2020-asrdiar / jsalt2020_simulate

Training data simulation

Python 46 7 Updated May 6, 2024

dmort27 / epitran

A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)

Python 673 124 Updated Jan 23, 2025

lingjzhu / CharsiuG2P

Multilingual G2P in 100 languages

Jupyter Notebook 296 24 Updated May 26, 2023

LetheSec / HuggingFace-Download-Accelerator

利用HuggingFace的官方下载工具从镜像网站进行高速下载。

Python 937 84 Updated Oct 12, 2024

lingjzhu / clap-ipa

Keyword spotting and forced alignment in any language

Python 48 3 Updated Jun 29, 2024

QingruZhang / AdaLoRA

AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).

Python 289 27 Updated Jun 1, 2023

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,193 706 Updated Dec 17, 2024

yeyupiaoling / Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…

C 941 155 Updated Jan 22, 2025

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,108 1,709 Updated Jan 29, 2025

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 23,196 2,283 Updated Jan 22, 2025

nachifur / RDDM

CVPR 2024: Residual Denoising Diffusion Models

Python 441 40 Updated Jan 11, 2025

dome272 / Diffusion-Models-pytorch

Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)

Python 1,237 283 Updated Sep 7, 2023

amusi / CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

18,749 2,622 Updated Jul 4, 2024

dobby-seo / Wav2Keyword

Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.

Python 103 29 Updated Jan 11, 2023

lilianemomeni / KWS-Net

Seeing Wake Words: Audio-visual Keyword Spotting

Python 64 12 Updated Sep 16, 2020

AILab-CVC / UniRepLKNet

[CVPR'24] UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition

Python 966 57 Updated Oct 24, 2024

LAION-AI / CLAP

Contrastive Language-Audio Pretraining

Python 1,514 149 Updated Nov 21, 2024

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 27,147 3,416 Updated Jul 23, 2024

HolgerBovbjerg / data2vec-KWS

This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining".

Python 27 5 Updated Jan 4, 2025

sovrasov / flops-counter.pytorch

Flops counter for convolutional networks in pytorch framework

Python 2,854 307 Updated Jan 20, 2025

kiwi1kkkkk

Lists (4)

model

open

SS

UDKWS

Stars