Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
🎨 Diagram as Code for prototyping cloud system architectures
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
đź’« Industrial-strength Natural Language Processing (NLP) in Python
aider is AI pair programming in your terminal
Write scalable load tests in plain Python đźš—đź’¨
A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.
DSPy: The framework for programming—not prompting—language models
Open source platform for the machine learning lifecycle
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Keep your application settings in sync (OS X/Linux)
Machine Learning Engineering Open Book
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
SGLang is a fast serving framework for large language models and vision language models.
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Always know what to expect from your data.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
An open source multi-tool for exploring and publishing data
Perform data science on data that remains in someone else's server