Skip to content
View andysalerno's full-sized avatar

Block or report andysalerno

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 117 13 Updated Apr 22, 2024

Faster structured generation

Rust 181 26 Updated Feb 24, 2025

nsync is a C library that exports various synchronization primitives, such as mutexes

C 1,122 86 Updated Jul 23, 2024

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 1,008 87 Updated Feb 24, 2025

Blazingly fast LLM inference.

Rust 5,085 357 Updated Feb 24, 2025

Effortlessly run LLM backends, APIs, frontends, and services with one command.

Python 1,303 98 Updated Feb 23, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 11,423 732 Updated Feb 24, 2025

A Beautiful Private and Secure Desktop Investment Tracking Application

TypeScript 4,947 263 Updated Feb 9, 2025

A standalone version of the readability lib

JavaScript 9,445 627 Updated Feb 14, 2025

A web crawler and scraper for Rust

Rust 1,506 125 Updated Feb 24, 2025

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 3,988 304 Updated Feb 13, 2025

πŸ€— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 140,081 28,084 Updated Feb 24, 2025

Local AI API Platform

C++ 2,496 151 Updated Feb 24, 2025

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 81,234 61,021 Updated Feb 24, 2025

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 1,710 75 Updated Feb 15, 2025

AICI: Prompts as (Wasm) Programs

Rust 2,001 82 Updated Jan 22, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,670 501 Updated Feb 24, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 78,320 9,301 Updated Feb 24, 2025

A simple, Git-powered wiki with a local frontend and support for many kinds of markup and content.

Ruby 13,946 1,563 Updated Jan 17, 2025

CMS/Wiki system using Javascript for 100% client side single page application using Markdown.

JavaScript 3,143 778 Updated Feb 2, 2025

⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains

TypeScript 23,756 2,264 Updated Feb 24, 2025

LLM training in simple, raw C/CUDA

Cuda 25,772 2,955 Updated Oct 2, 2024

Inference Llama 2 in one file of pure C

C 18,071 2,200 Updated Aug 6, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 14,176 1,164 Updated May 23, 2024

Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.

146 3 Updated May 16, 2024

Tile primitives for speedy kernels

Cuda 2,072 117 Updated Feb 22, 2025

πŸ™Œ OpenHands: Code Less, Make More

Python 47,639 5,243 Updated Feb 24, 2025

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 17,975 2,199 Updated Feb 23, 2025

Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message se…

TypeScript 22,382 3,736 Updated Feb 24, 2025

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,344 732 Updated Aug 5, 2024
Next