Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guid…

JavaScript 9,465 1,831 Updated Dec 11, 2024

JiehangXie / PaddleBoBo

基于飞桨开发的虚拟主播

Python 1,045 299 Updated Mar 12, 2023

OpenTalker / SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 12,096 2,249 Updated Jun 26, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 36,927 4,205 Updated Nov 7, 2024

Edresson / YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Jupyter Notebook 919 81 Updated Nov 4, 2024

philiptzou / advanced-langconv

A Python based language converter

Python 8 2 Updated May 25, 2011

Rudrabha / Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 10,979 2,329 Updated Nov 26, 2024

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,259 1,861 Updated Dec 12, 2024

kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 14,362 5,326 Updated Nov 29, 2024

open-speech / speech-aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

C++ 396 105 Updated Apr 8, 2020

elevenlabs / elevenlabs-python

The official Python API for ElevenLabs Text to Speech.

Python 2,262 264 Updated Dec 15, 2024

Revocalize / revocalize-python

The official Python API for Revocalize AI voice synthesizer platform.

Python 8 2 Updated Sep 11, 2023

netease-youdao / EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,520 639 Updated Aug 13, 2024

PlayVoice / lora-svc

singing voice change based on whisper, and lora for singing voice clone

Python 631 78 Updated Nov 3, 2023

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,719 767 Updated Feb 11, 2024

PlayVoice / whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,697 924 Updated Apr 23, 2024

TensorSpeech / TensorFlowTTS

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,851 816 Updated Jul 5, 2024

Executedone / Chinese-FastSpeech2

基于标贝数据继续训练，同时对原本的FastSpeech2模型做了改进，引入了韵律表征以及韵律预测模块，使中文发音更生动且富有节奏

Python 250 42 Updated Sep 10, 2023

babysor / MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 35,498 5,215 Updated Nov 15, 2024

chatpire / chatgpt-web-share

ChatGPT Plus 共享方案。ChatGPT Plus / OpenAI API sharing solution.

Vue 4,316 679 Updated Nov 1, 2024

C0untFloyd / bark-gui

Forked from suno-ai/bark

🔊 Text-Prompted Generative Audio Model with Gradio

Python 680 64 Updated Nov 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jacksonEE

Block or report jacksonEE

Stars

okxapi / okx-sample-market-maker

okxapi / python-okx

fudan-generative-vision / hallo

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Zejun-Yang / AniPortrait

HumanAIGC / EMO

facefusion / facefusion

lballabio / QuantLib

xszyou / fay-ue5

xszyou / Fay