Stars
LlamaIndex is a data framework for your LLM applications
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text
Making large AI models cheaper, faster and more accessible
ZenML 🙏: The bridge between ML and Ops. https://zenml.io.
An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU su…
Open Source Platform for developing, scaling and deploying serious ML, AI, and data science systems
Machine Learning Toolkit for Kubernetes
Open source platform for the machine learning lifecycle
Distributed ML Training and Fine-Tuning on Kubernetes
Open Source, Google Zanzibar-inspired database for scalably storing and querying fine-grained authorization data
Google Chrome and Firefox extension that prevents the blocking of pasting into input fields
An open-source cross-platform alternative to AirDrop
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with comma…
Umami is a simple, fast, privacy-focused alternative to Google Analytics.
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
A Python library to extract tabular data from PDFs
Highly available elephant herd: HA PostgreSQL cluster using Docker
Enhanced ChatGPT Clone: Features Agents, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code…