-
University of West Bohemia
- Pilsen, Czech Republic
Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Stars
An AI-powered data science team of agents to help you perform common data science tasks 10X faster.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
pix2tex: Using a ViT to convert images of equations into LaTeX code.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.
Simple text to phones converter for multiple languages
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
Multilingual G2P in 100 languages
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
Experimental ground for optimizing memory of pytorch models
Confidence interval computation for evaluation in machine learning using the bootstrapping approach
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, D…
MARS5 speech model (TTS) from CAMB.AI
Guided course to crash into the most basic ML algorithms.
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Foundational model for human-like, expressive TTS
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
'Grad-TTS' with Multilingual Cleaners
hcy71o / SC-VITS
Forked from jaywalnut310/vitsVITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.
text to speech using autoregressive transformer and VITS
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint