Highlights
- Pro
Stars
The official implementation of HierSpeech++
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
Object-oriented handling of audio data, with GPU-powered augmentations, and more.
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
🤖 Build voice-based LLM agents. Modular + open source.
HTTP(S) benchmark tools, testing/debugging, & restAPI (RESTful)
A high-performance HTTP benchmarking tool that includes a real-time web UI and terminal display
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
Performant and accurate speech recognition built on Pytorch
macOS command line utility to configure multi-display resolutions and arrangements. Essentially XRandR for macOS.
resemble-ai / normalise
Forked from EFord36/normaliseA module for normalising text.
Rich is a Python library for rich text and beautiful formatting in the terminal.
A playbook for systematically maximizing the performance of deep learning models.
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
Non-distributional linguistic word vector representations.
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
NBA Stats API via Basketball Reference
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
the AI-native open-source embedding database
A desktop application for viewing and analyzing tabular data
Fast supervised sentence boundary detection using the averaged perceptron