Stars
DuckDB is an analytical in-process SQL database management system
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
Large-scale LLM inference engine
A crowdsourced distributed cluster for AI art and text generation
An OAI compatible exllamav2 API that's both lightweight and fast
LostRuins / koboldcpp
Forked from ggml-org/llama.cppRun GGUF models easily with a KoboldAI UI. One File. Zero Install.
LLM Frontend for Power Users.
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
⬆️ GitHub Actions uptime monitor & status page by @AnandChowdhary
Wan: Open and Advanced Large-Scale Video Generative Models
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
DeepEP: an efficient expert-parallel communication library
The most intuitive desktop API client. Organize and execute REST, GraphQL, WebSockets, Server Sent Events, and gRPC 🦬
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
TheBoringNotch: Not so boring notch That Rocks 🎸🎶
The official Python SDK for Model Context Protocol servers and clients
Model Context Protocol Servers
FlashMLA: Efficient MLA Decoding Kernel for Hopper GPUs
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
Open Source Continuous File Synchronization
Muon optimizer: +>30% sample efficiency with <3% wallclock overhead
An app that brings language models directly to your phone.
Collection of apple-native tools for the model context protocol.
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥