The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 48,268 5,706 Updated Sep 18, 2024

zjwang21 / mix-phoneme-bert

An unofficial PyTorch implementation of Mix-Phoneme-Bert

Python 39 7 Updated Jul 10, 2023

liusongxiang / Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

463 28 Updated Sep 26, 2024

anonymous-pits / pits

PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor

Python 277 34 Updated Jul 16, 2023

heatz123 / naturalspeech

A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)

Python 472 68 Updated Feb 7, 2024

haoheliu / AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,495 226 Updated Dec 9, 2024

AI-Unicamp / TTS-Objective-Metrics

Objective metrics used in several text-to-speech (TTS) papers.

Python 46 9 Updated Apr 22, 2022

xinjli / transphone

phoneme tokenizer and grapheme-to-phoneme model for 8k languages

Python 150 15 Updated Jun 9, 2023

sdercolin / vlabeler

Open source voice labeling application

Kotlin 153 22 Updated Nov 6, 2024

LAION-AI / Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,128 3,248 Updated Aug 17, 2024

enhuiz / vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,974 419 Updated May 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tuong-olli

Block or report tuong-olli

Starred repositories

glory20h / VoiceLDM

hpcaitech / ColossalAI

magic-research / bubogpt

leejet / stable-diffusion.cpp

coqui-ai / TTS

gmltmd789 / UnitSpeech

svc-develop-team / so-vits-svc

facebookresearch / fairseq

linan2 / Voice-activity-detection-VAD-paper-and-code

anyvoiceai / Barkify

monglechap / fluenttts

roatienza / efficientspeech

langchain-ai / langchain

suno-ai / bark

modularml / mojo

maum-ai / phaseaug

serp-ai / bark-with-voice-clone

declare-lab / tango

lucidrains / naturalspeech2-pytorch

facebookresearch / segment-anything