Skip to content
View BrightGu's full-sized avatar
  • 安徽合肥

Block or report BrightGu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Robust Speech Recognition via Large-Scale Weak Supervision

Python 72,803 8,683 Updated Dec 1, 2024

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 18,639 1,387 Updated Dec 9, 2024

The official Python API for ElevenLabs Text to Speech.

Python 2,260 264 Updated Dec 14, 2024

vits2 backbone with multilingual-bert

Python 8,076 1,145 Updated Dec 9, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,415 4,282 Updated Aug 19, 2024

Voice Conversion With Just Nearest Neighbors

Python 462 68 Updated Mar 18, 2024

Easily train a good VC model with voice data <= 10 mins!

Python 25,213 3,686 Updated Nov 24, 2024

リアルタイムボイスチェンジャー Realtime Voice Changer

Python 16,714 1,819 Updated Nov 14, 2024

Official Implementation of FreeDrag (CVPR 2024)

Python 413 20 Updated May 6, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,084 4,419 Updated Aug 16, 2024

Singing Voice Conversion via diffusion model

Jupyter Notebook 2,650 806 Updated Jul 10, 2023

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 6,951 1,270 Updated Dec 6, 2023

The official gpt4free repository | various collection of powerful language models

Python 62,685 13,432 Updated Dec 14, 2024

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Python 4,362 716 Updated May 2, 2023

SoftVC VITS Singing Voice Conversion

Python 26,115 4,860 Updated Nov 11, 2023

Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022

Python 2,004 159 Updated Aug 20, 2022

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,257 1,862 Updated Dec 12, 2024

Voice Conversion Based on Learnable Similarity-Guided Masked Autoencoder

Python 5 Updated Sep 30, 2022

Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)

Python 113 13 Updated Feb 7, 2024

To provide the stego community with C/C++ implementations of selected feature extractors mainly targeted at H.264 steganography.

81 12 Updated Jun 2, 2021

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 7,897 1,897 Updated Sep 26, 2024

Package conda environments for redistribution

Python 530 94 Updated Dec 2, 2024

A Pytorch Toy Implementation of 'Dynamic Region-Aware Convolution (ECCV2020)'

Python 103 17 Updated May 15, 2021

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,287 27,294 Updated Dec 14, 2024

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 1,989 510 Updated Jul 27, 2024

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 35,495 5,215 Updated Nov 15, 2024

Calculation of MCD (dB) between two speech waveforms

Jupyter Notebook 57 14 Updated Sep 26, 2020

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

Python 54 4 Updated Oct 11, 2021

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型,适用于英语、普通话/中文、日语、韩语、俄语和藏语(当前已测试)。

Python 255 46 Updated Mar 25, 2023
Next