Lists (1)
Sort Name ascending (A-Z)
Stars
🔊 Text-Prompted Generative Audio Model
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
This repository contains the source code for the paper First Order Motion Model for Image Animation
A multi-voice TTS system trained with an emphasis on quality
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs,…
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
A Bulletproof Way to Generate Structured JSON from Language Models
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.