Skip to content
View sciai-ai's full-sized avatar

Block or report sciai-ai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A fast python library for aligning similar audio snippets passed in as NumPy arrays

Python 43 2 Updated Aug 15, 2024

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Python 590 87 Updated Apr 26, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,256 1,105 Updated Nov 14, 2024

Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" accepted by INTERSPEECH 2023.

Python 73 5 Updated Jul 16, 2023

Object-oriented handling of audio data, with GPU-powered augmentations, and more.

Python 1 Updated Jul 21, 2023

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.…

Python 1 Updated Jul 12, 2023

A guidance language for controlling large language models.

Jupyter Notebook 19,545 1,065 Updated Jan 29, 2025

SpeechGPT Series: Speech Large Language Models

Python 1,331 89 Updated Jul 22, 2024

Numbers every LLM developer should know

4,156 141 Updated Jan 16, 2024

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,305 104 Updated Sep 24, 2023

💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.

Python 203 29 Updated Oct 30, 2024

Experiments w/ ChatGPT, LangChain, local LLMs

Python 24 1 Updated Jun 5, 2023

A high-quality, varied ~30hr voice dataset suitable for training a TTS model

58 3 Updated Jan 7, 2023

Free Auto GPT with NO paids API is a repository that offers a simple version of Auto GPT, an autonomous AI agent capable of performing tasks independently. Unlike other versions, our implementation…

Python 2,497 392 Updated Jun 19, 2024

Easily train a good VC model with voice data <= 10 mins!

Python 26,510 3,805 Updated Nov 24, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,783 4,331 Updated Aug 19, 2024

PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

Python 765 53 Updated Nov 19, 2024

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…

Python 1,705 105 Updated Aug 29, 2023

The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions

Python 52 2 Updated Apr 1, 2021

OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, epub, etc) using a simple React frontend.

JavaScript 3,310 307 Updated Jan 26, 2025

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 13,179 923 Updated Oct 3, 2024

multilingual speech aligner

Python 73 5 Updated Nov 19, 2023

Inference code for Llama models

Python 57,427 9,688 Updated Jan 26, 2025

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

Python 298 21 Updated Nov 12, 2024

VQ-VAE for Acoustic Unit Discovery and Voice Conversion

Python 333 45 Updated Jul 6, 2023

Official implementation of DualCycleGAN for nonparallel audio super resolution

Python 51 5 Updated Nov 1, 2022

Benchmark popular audio i/o packages

Python 139 10 Updated Dec 19, 2023

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Python 2,091 53 Updated Nov 27, 2024

Lightweight, super fast C/C++ (& Python) library for sequence alignment using edit (Levenshtein) distance.

C++ 526 167 Updated Sep 4, 2024

Unsupervised domain adaptation for conversational speech enhancement using RemixIT

Jupyter Notebook 53 5 Updated Apr 25, 2023
Next