Stars
An OAI compatible exllamav2 API that's both lightweight and fast
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
We run Node.js with Ollama Hosting LLM locally and we use D-ID for Live Avatar
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with comma…
The easiest way to run WireGuard VPN + Web-based Admin UI.
This repository contains demos I made with the Transformers library by HuggingFace.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
A Cookbook to start building with LLMs
A quick guide (especially) for trending instruction finetuning datasets
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
A framework for few-shot evaluation of language models.
This is our own implementation of 'Layer Selective Rank Reduction'
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
Unofficial Bitwarden compatible server written in Rust, formerly known as bitwarden_rs
Speedtest Tracker is a self-hosted application that monitors the performance and uptime of your internet connection.
A highly customizable homepage (or startpage / application dashboard) with Docker and service API integrations.
Data extraction with LLM on CPU
Web UI for working with large language models
A Gradio web UI for Large Language Models with support for multiple inference backends.