Highlights
- Pro
Stars
A Blazing Fast AI Gateway with integrated Guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
A python application that routes incoming prompts to an LLM by category, and can support a single incoming connection from a front end to many backend connections to LLMs, allowing one AI Assistant…
Let your Claude able to think
LLM Finetuning with peft
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Tensors and dynamic neural networks in pure Rust.
“连续八年成为全世界最受喜爱的语言,无 GC 也无需手动内存管理、极高的性能和安全性、过程/OO/函数式编程、优秀的包管理、JS 未来基石" — 工作之余的第二语言来试试 Rust 吧。本书拥有全面且深入的讲解、生动贴切的示例、德芙般丝滑的内容,这可能是目前最用心的 Rust 中文学习教程 / Book
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
LOTUS: A semantic query engine - process data with LLMs as easily as writing pandas code
Empowering RAG with a memory-based data interface for all-purpose applications!
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
A simple, easy-to-hack GraphRAG implementation
🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴
Following emerging Large Language Model Operations (LLM Ops) best practices in the industry, you’ll learn all about the key technologies that enable Generative AI practitioners like you to leverage…
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理