Lists (1)
Sort Name ascending (A-Z)
Stars
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
π€ π¬ Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
ML-powered speech recognition directly in your browser
A collection of (mostly) technical things every software developer should know about
PraisonAI is an AI Agents Framework with Self Reflection. PraisonAI application combines PraisonAI Agents, AutoGen, and CrewAI into a low-code solution for building and managing multi-agent LLM sysβ¦
ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.
Too Long, Didn't Watch: End-to-End Rolling Summarizer of Long Videos
High-performance In-browser LLM Inference Engine
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
A generative speech model for daily dialogue.
aider is AI pair programming in your terminal
The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but completely free and 100% private.
LlamaIndex is a data framework for your LLM applications
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Presidential bot built on top of Llama3-8B fine-tune over +100 hours of video interviews
β© Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The β¦
Easy Docker setup for Stable Diffusion with user-friendly UI
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
π Text-Prompted Generative Audio Model
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of β¦
the AI-native open-source embedding database
Drag & drop UI to build your customized LLM flow
π₯ Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.