Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,257 1,862 Updated Dec 12, 2024

BrightGu / MAE-VC

Voice Conversion Based on Learnable Similarity-Guided Masked Autoencoder

Python 5 Updated Sep 30, 2022

YoungSeng / SRD-VC

Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)

Python 113 13 Updated Feb 7, 2024

zhanghong863 / Feature-Extractors-for-Video-Steganalysis

To provide the stego community with C/C++ implementations of selected feature extractors mainly targeted at H.264 steganography.

81 12 Updated Jun 2, 2021

nl8590687 / ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 7,897 1,897 Updated Sep 26, 2024

conda / conda-pack

Package conda environments for redistribution

Python 530 94 Updated Dec 2, 2024

shallowtoil / DRConv-PyTorch

A Pytorch Toy Implementation of 'Dynamic Region-Aware Convolution (ECCV2020)'

Python 103 17 Updated May 15, 2021

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,287 27,294 Updated Dec 14, 2024

jik876 / hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 1,989 510 Updated Jul 27, 2024

babysor / MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 35,495 5,215 Updated Nov 15, 2024

SamuelBroughton / Mel-Cepstral-Distortion

Calculation of MCD (dB) between two speech waveforms

Jupyter Notebook 57 14 Updated Sep 26, 2020

BrightGu / MediumVC

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

Python 54 4 Updated Oct 11, 2021

atomicoo / FCH-TTS

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

Python 255 46 Updated Mar 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

谷下雨 BrightGu

Achievements

Achievements

Block or report BrightGu

Lists (1)

🚀 My stack

Stars

openai / whisper

Anjok07 / ultimatevocalremovergui

elevenlabs / elevenlabs-python

fishaudio / Bert-VITS2

suno-ai / bark

bshall / knn-vc

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

w-okada / voice-changer

chq1155 / A-Survey-on-Generative-Diffusion-Model

LPengYang / FreeDrag

coqui-ai / TTS

prophesier / diff-svc

jaywalnut310 / vits

xtekky / gpt4free

MoonInTheRiver / DiffSinger

svc-develop-team / so-vits-svc

andreas128 / RePaint

PaddlePaddle / PaddleSpeech