Stars
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
🔊 Text-Prompted Generative Audio Model
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
Foundational Models for State-of-the-Art Speech and Text Translation
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
A self-organizing file system with llama 3
serp-ai / bark-with-voice-clone
Forked from suno-ai/bark🔊 Text-prompted Generative Audio Model - With the ability to clone voices
MARS5 speech model (TTS) from CAMB.AI
[ICCV 2023] DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders