Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,397 640 Updated Jan 23, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 7,871 821 Updated Jan 24, 2025

THUDM / CodeGeeX2

CodeGeeX2: A More Powerful Multilingual Code Generation Model

Python 7,645 532 Updated Jul 10, 2024

netease-youdao / EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,616 650 Updated Aug 13, 2024

Morizeyao / GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

Python 7,505 1,706 Updated Apr 25, 2024

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,079 1,293 Updated Dec 6, 2023

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,283 1,099 Updated Jan 10, 2025

TensorSpeech / TensorFlowTTS

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,882 816 Updated Jul 5, 2024

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,579 313 Updated Jan 4, 2024

L1aoXingyu / pytorch-beginner

pytorch tutorial for beginners

Python 3,003 1,088 Updated Feb 12, 2022

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,780 187 Updated Nov 14, 2024

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,138 151 Updated Jan 27, 2025

geatpy-dev / geatpy

Evolutionary algorithm toolbox and framework with high performance for Python

Python 2,048 728 Updated Jan 17, 2025

yxlllc / DDSP-SVC

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Python 1,983 252 Updated Jan 13, 2025

CheshireCC / faster-whisper-GUI

faster_whisper GUI with PySide6

Python 1,967 117 Updated Dec 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

francklinson

Block or report francklinson

Stars

huggingface / transformers

openai / whisper

RVC-Boss / GPT-SoVITS

google-research / bert

coqui-ai / TTS

facebookresearch / fairseq

Anjok07 / ultimatevocalremovergui

fishaudio / fish-speech

SYSTRAN / faster-whisper

ShangtongZhang / reinforcement-learning-an-introduction

THUDM / ChatGLM3

m-bain / whisperX

speechbrain / speechbrain

MorvanZhou / Reinforcement-learning-with-tensorflow

espnet / espnet

open-mmlab / Amphion