Stars
Hunt down social media accounts by username across social networks
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
๐ OpenHands: Code Less, Make More
aider is AI pair programming in your terminal
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
A UI-Focused Agent for Windows OS Interaction.
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Build real-time multimodal AI applications ๐ค๐๏ธ๐น
The only reliable agent framework built on top of the latest OpenAI Assistants API.
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Convert any PDF into a podcast episode!
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data
Extract clean data from anywhere, powered by vision-language models โก
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
Information Assistant, built with Azure OpenAI Service, Industry Accelerator
A python application that routes incoming prompts to an LLM by category, and can support a single incoming connection from a front end to many backend connections to LLMs, allowing one AI Assistantโฆ
A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)