Skip to content
View cyberrrange's full-sized avatar
😪
😪

Highlights

  • Pro

Block or report cyberrrange

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

List of diffusion related active submissions on OpenReview for ICLR 2025.

4 Updated Oct 27, 2024

Easily train a good VC model with voice data <= 10 mins!

Python 25,849 3,745 Updated Nov 24, 2024

zero-shot voice conversion & singing voice conversion, with real-time support

Python 857 105 Updated Jan 4, 2025
Python 63 7 Updated Sep 3, 2024

Bittensor's Voice Guard subnet functioning as a anti-voice deepfake.

Python 6 2 Updated Dec 26, 2024

Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec…

Python 42 4 Updated Jul 29, 2024

Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset

Python 155 23 Updated Mar 25, 2019
Python 21 2 Updated Jan 9, 2024

StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation

Python 200 26 Updated Sep 13, 2024

Conditional Diffusion Probabilistic Model for Speech Enhancement

Python 221 34 Updated Dec 20, 2022

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 552 77 Updated Dec 30, 2024

Release for Improved Denoising Diffusion Probabilistic Models

Python 3,376 496 Updated Jul 18, 2024

Pytorch Reimplementation of DiffWave unconditional generation: a high quality waveform synthesizer.

Python 35 5 Updated Apr 13, 2021

A PyTorch-based Speech Toolkit

Python 9,151 1,415 Updated Jan 4, 2025

The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate".

Python 92 3 Updated Nov 27, 2024

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Python 714 119 Updated Dec 1, 2024

Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube dow…

Python 2,474 186 Updated Dec 22, 2024

在没有sudo权限的情况下,在linux上使用clash

Shell 54 6 Updated Nov 14, 2024

语音增强论文,降噪、去混响等

13 2 Updated Jun 21, 2024

SA-toolkit: Speaker speech anonymization toolkit in python

Python 19 1 Updated Jan 7, 2025
8 Updated Sep 24, 2024

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,556 311 Updated Jan 4, 2024

Optimized implementation for color-icon-matrix barcodes

C++ 4,593 323 Updated Dec 9, 2024

collect

45 17 Updated May 6, 2020

Remote Sensing Image Classification Dataset for Aircraft Fine-Grained Recognition

10 Updated Dec 13, 2024

Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730

Python 128 9 Updated Dec 8, 2023

A curated list of awesome audio adversarial examples papers(with code & demo if available).

31 5 Updated Apr 26, 2020

[NeurIPS 2024 spotlight] Offical implementation of MSFA and release of SARDet_100K dataset for Large-Scale Synthetic Aperture Radar (SAR) Object Detection

Python 428 28 Updated Oct 29, 2024

Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data

344 28 Updated Dec 22, 2024
Next