Stars
Realtime voice assistant powered by Groq's whisper API, Groq's Llama and ElevenLabs text-to-speech
LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.
A simple screen parsing tool towards pure vision based GUI agent
Example code for using the Instructor python library with Groq and OpenAI LLMs
GPT-3 chatbot with long-term memory and external sources
A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages
Benchmarking the serving capabilities of vLLM
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports LLMs, embeddings, and speech-to-text.
A repository that provides a thorough collection of approaches and methods used for evaluating Large Language Models (LLMs).
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
The Big list of the github, open-source compilers.
The most advanced AI retrieval system. Containerized, Retrieval-Augmented Generation (RAG) with a RESTful API.
OCR, layout analysis, reading order, table recognition in 90+ languages
Convert PDF to markdown + JSON quickly with high accuracy
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
High-performance retrieval engine for unstructured data
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.
Developer APIs to Accelerate LLM Projects
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Enforce the output format (JSON Schema, Regex etc) of a language model
InstructLab Command-Line Interface. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data.