Stars
LLM Inference
5 repositories
The simplest way to serve AI/ML models in production
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
Enforce the output format (JSON Schema, Regex etc) of a language model
Build Multimodal AI Agents with memory, knowledge and tools. Simple, fast and model-agnostic.