sciai-ai

sciai-ai

7 followers · 163 following

Stars

nomonosound / fast-align-audio

A fast python library for aligning similar audio snippets passed in as NumPy arrays

Python 43 2 Updated Aug 15, 2024

xinjli / allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Python 590 87 Updated Apr 26, 2024

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,256 1,105 Updated Nov 14, 2024

Aria-K-Alethia / laughter-synthesis

Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" accepted by INTERSPEECH 2023.

Python 73 5 Updated Jul 16, 2023

babua / audiotools

Forked from descriptinc/audiotools

Object-oriented handling of audio data, with GPU-powered augmentations, and more.

Python 1 Updated Jul 21, 2023

mallorbc / lit-gpt

Forked from Lightning-AI/litgpt

Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.…

Python 1 Updated Jul 12, 2023

guidance-ai / guidance

A guidance language for controlling large language models.

Jupyter Notebook 19,545 1,065 Updated Jan 29, 2025

0nutation / SpeechGPT

SpeechGPT Series: Speech Large Language Models

Python 1,331 89 Updated Jul 22, 2024

ray-project / llm-numbers

Numbers every LLM developer should know

4,156 141 Updated Jan 16, 2024

lucidrains / naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,305 104 Updated Sep 24, 2023

Wordcab / wordcab-transcribe

💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.

Python 203 29 Updated Oct 30, 2024

AUGMXNT / llm-experiments

Experiments w/ ChatGPT, LangChain, local LLMs

Python 24 1 Updated Jun 5, 2023

dioco-group / jenny-tts-dataset

A high-quality, varied ~30hr voice dataset suitable for training a TTS model

58 3 Updated Jan 7, 2023

IntelligenzaArtificiale / Free-Auto-GPT

Free Auto GPT with NO paids API is a repository that offers a simple version of Auto GPT, an autonomous AI agent capable of performing tasks independently. Unlike other versions, our implementation…

Python 2,497 392 Updated Jun 19, 2024

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Python 26,510 3,805 Updated Nov 24, 2024

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,783 4,331 Updated Aug 19, 2024

triton-inference-server / pytriton

PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

Python 765 53 Updated Nov 19, 2024

ttengwang / Caption-Anything

Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…

Python 1,705 105 Updated Aug 29, 2023

ryanrudes / YTTTS

The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions

Python 52 2 Updated Apr 1, 2021

OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, epub, etc) using a simple React frontend.

JavaScript 3,310 307 Updated Jan 26, 2025

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 13,179 923 Updated Oct 3, 2024

xinjli / alqalign

multilingual speech aligner

Python 73 5 Updated Nov 19, 2023

meta-llama / llama

Inference code for Llama models

Python 57,427 9,688 Updated Jan 26, 2025

EtienneAb3d / WhisperHallu

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

Python 298 21 Updated Nov 12, 2024

bshall / ZeroSpeech

VQ-VAE for Acoustic Unit Discovery and Voice Conversion

Python 333 45 Updated Jul 6, 2023

chomeyama / DualCycleGAN

Official implementation of DualCycleGAN for nonparallel audio super resolution

Python 51 5 Updated Nov 1, 2022

faroit / python_audio_loading_benchmark

Benchmark popular audio i/o packages

Python 139 10 Updated Dec 19, 2023

lucidrains / lion-pytorch

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Python 2,091 53 Updated Nov 27, 2024

Martinsos / edlib

Lightweight, super fast C/C++ (& Python) library for sequence alignment using edit (Levenshtein) distance.

C++ 526 167 Updated Sep 4, 2024

etzinis / unsup_speech_enh_adaptation

Unsupervised domain adaptation for conversational speech enhancement using RemixIT

Jupyter Notebook 53 5 Updated Apr 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly