Stars
A guidance language for controlling large language models.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
This repo is related to the Art-Palette experiment from Google Arts & Culture.
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Official inference repo for FLUX.1 models
Entropy Based Sampling and Parallel CoT Decoding
A C++ implementation of the fast voxel traversal algorithm.
Distributed data engine for Python/SQL designed for the cloud, powered by Rust
Realtime application framework (Node.JS server)
sensible.vim: Defaults everyone can agree on
Port of OpenAI's Whisper model in C/C++
This is the git repository for sharing tutorial slides of CS3210.
Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models)
Final Year Project - investigation on deep learning models for Hong Kong Mahjong
GNU Bison and GNU Flex C++ example
llama3.np is a pure NumPy implementation for Llama 3 model.
Fast & memory efficient hashtable based on robin hood hashing for C++11/14/17/20
Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA