Skip to content
View jmaty's full-sized avatar
  • University of West Bohemia
  • Pilsen, Czech Republic

Highlights

  • Pro

Block or report jmaty

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An AI-powered data science team of agents to help you perform common data science tasks 10X faster.

Python 1,456 263 Updated Feb 25, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,481 1,136 Updated Mar 1, 2025

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

11,176 2,342 Updated Mar 4, 2025

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 442 42 Updated Feb 12, 2025

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 13,669 1,090 Updated Jan 18, 2025

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 10,214 799 Updated Mar 4, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 10,063 1,373 Updated Feb 24, 2025

Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.

Python 26 Updated Mar 5, 2025

Simple text to phones converter for multiple languages

Python 1,343 183 Updated Sep 26, 2024

VITS2 extended with XPhoneBERT encoder

Python 8 3 Updated Oct 19, 2024

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

Python 316 39 Updated Jul 22, 2024

Multilingual G2P in 100 languages

Jupyter Notebook 300 24 Updated May 26, 2023

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,623 670 Updated Mar 3, 2025

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Python 240 47 Updated Jan 13, 2025

Experimental ground for optimizing memory of pytorch models

Python 365 35 Updated Apr 23, 2018

Confidence interval computation for evaluation in machine learning using the bootstrapping approach

Jupyter Notebook 77 9 Updated Apr 5, 2024

AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, D…

HTML 1,559 167 Updated Feb 27, 2025

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,628 216 Updated Aug 1, 2024

Guided course to crash into the most basic ML algorithms.

Jupyter Notebook 53 9 Updated Apr 2, 2024

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 2,059 523 Updated Jul 27, 2024

Foundational model for human-like, expressive TTS

Python 4,053 676 Updated Jul 30, 2024

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Jupyter Notebook 573 125 Updated Sep 18, 2023

'Grad-TTS' with Multilingual Cleaners

Jupyter Notebook 10 1 Updated Apr 6, 2024

VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.

Python 36 2 Updated Sep 21, 2022

text to speech using autoregressive transformer and VITS

Python 235 17 Updated Apr 3, 2024

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,507 275 Updated Jan 12, 2025

Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint

Python 66 10 Updated Jun 25, 2023

Unoffical implementation of Megatts2

Python 277 36 Updated Mar 23, 2024

The Open Source Code of UniAudio

Python 546 32 Updated Jul 22, 2024
Next