jmaty

Jindrich Matousek jmaty

1 follower · 0 following

University of West Bohemia
Pilsen, Czech Republic

Achievements

Highlights

Lists (3)

Sort

Stars

business-science / ai-data-science-team

An AI-powered data science team of agents to help you perform common data science tasks 10X faster.

Python 1,456 263 Updated Feb 25, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,481 1,136 Updated Mar 1, 2025

aishwaryanr / awesome-generative-ai-guide

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

11,176 2,342 Updated Mar 4, 2025

lucidrains / e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 442 42 Updated Feb 12, 2025

lukas-blecher / LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 13,669 1,090 Updated Jan 18, 2025

cleanlab / cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 10,214 799 Updated Mar 4, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 10,063 1,373 Updated Feb 24, 2025

IIEleven11 / Automatic-Audio-Dataset-Maker

Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.

Python 26 Updated Mar 5, 2025

bootphon / phonemizer

Simple text to phones converter for multiple languages

Python 1,343 183 Updated Sep 26, 2024

HuuHuy227 / XphoneBert_Vits2

VITS2 extended with XPhoneBERT encoder

Python 8 3 Updated Oct 19, 2024

VinAIResearch / XPhoneBERT

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

Python 316 39 Updated Jul 22, 2024

lingjzhu / CharsiuG2P

Multilingual G2P in 100 languages

Jupyter Notebook 300 24 Updated May 26, 2023

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,623 670 Updated Mar 3, 2025

yl4579 / PL-BERT

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Python 240 47 Updated Jan 13, 2025

IIEleven11 / StyleTTS2FineTune

Python 205 34 Updated Oct 2, 2024

prigoyal / pytorch_memonger

Experimental ground for optimizing memory of pytorch models

Python 365 35 Updated Apr 23, 2018

luferrer / ConfidenceIntervals

Confidence interval computation for evaluation in machine learning using the bootstrapping approach

Jupyter Notebook 77 9 Updated Apr 5, 2024

erew123 / alltalk_tts

AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, D…

HTML 1,559 167 Updated Feb 27, 2025