Stars
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
React app for inspecting, building and debugging with the Realtime API
This is a Google Apps Script library for Gemini API with files.
A project that gives ChatGPT Vision spatial awareness and the ability to give accurate screen coordinates.
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
HAAS = Hierarchical Autonomous Agent Swarm - "Resistance is futile!"
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
An open source payments switch written in Rust to make payments fast, reliable and affordable
Agent Cloud is like having your own GPT builder with a bunch extra goodies. The GUI features 1) RAG pipeline which can natively embed 260+ datasources 2) Create Conversational apps (like GPTs) 3) Cβ¦
Letta (formerly MemGPT) is a framework for creating LLM services with memory.
ππ§ π¬ Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).
Expert AI code reviews Catch bugs before they hurt Run Scanline in your CLI to find: - race conditions - logical errors - security risks - optimizations
We write your reusable computer vision tools. π
Examples and guides for using the OpenAI API
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Stable Diffusion web UI
A complete computer science study plan to become a software engineer.
A natural language interface for computers
An example usage of object detection on realtime screen stream, and show detection with OS overlay notifications.