Lists (1)
Sort Name ascending (A-Z)
Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Chatbot for documentation, that allows you to chat with your data. Privately deployable, provides AI knowledge sharing and integrates knowledge into your AI workflow
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Unified framework for building enterprise RAG pipelines with small, specialized models
A terminal spreadsheet multitool for discovering and arranging data
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
A lightweight REST framework that wraps the Apprise Notification Library
Make Discord your LLM frontend ● Supports any OpenAI compatible API (Ollama, LM Studio, vLLM, OpenRouter, xAI, Mistral, Groq and more)
Create and manage data pipes with Meerschaum.