Stars
⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
thinhlpg / TTS
Forked from coqui-ai/TTS🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.
🐸 - A general purpose model trainer, as flexible as it gets
A python package to analyze and compare voices with deep learning
OpenChat: Advancing Open-source Language Models with Imperfect Data
A high-throughput and memory-efficient inference and serving engine for LLMs
State-of-the-art 2D and 3D Face Analysis Project
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
152334H / DL-Art-School
Forked from neonbjb/DL-Art-SchoolTorToiSe fine-tuning with DLAS
Simple example of FastAPI + gRPC AsyncIO + Triton
Official inference library for Mistral models
Easily train a good VC model with voice data <= 10 mins!
FastAPI Best Practices and Conventions we used at our startup
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs
A dynamic FastAPI router that automatically creates CRUD routes for your models
This code generator creates FastAPI app from an openapi file.
Fast, beautiful and extensible administrative interface framework for Starlette & FastApi applications