Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.

Rust 9,142 456 Updated Dec 12, 2024

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 6,502 576 Updated Dec 12, 2024

dorianbrown / rank_bm25

A Collection of BM25 Algorithms in Python

Python 1,063 89 Updated Oct 8, 2024

lettier / 3d-game-shaders-for-beginners

🎮 A step-by-step guide to implementing SSAO, depth of field, lighting, normal mapping, and more for your 3D game.

C++ 18,054 1,392 Updated Jun 25, 2023

gusye1234 / nano-graphrag

A simple, easy-to-hack GraphRAG implementation

Python 1,912 183 Updated Nov 23, 2024

1rgs / jsonformer

A Bulletproof Way to Generate Structured JSON from Language Models

Jupyter Notebook 4,488 158 Updated Feb 24, 2024

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 20,100 1,524 Updated Dec 13, 2024

karpathy / nano-llama31

nanoGPT style version of Llama 3.1

Python 1,267 67 Updated Aug 8, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,885 1,025 Updated Dec 11, 2024

S-LoRA / S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,765 99 Updated Jan 21, 2024

test-time-training / ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,060 64 Updated Jul 14, 2024

microsoft / DeepSpeedExamples

Example models using DeepSpeed

Python 6,147 1,050 Updated Nov 27, 2024

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 31,930 862 Updated Dec 13, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

30,484 1,668 Updated Aug 1, 2024

huggingface / accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,049 987 Updated Dec 11, 2024

unslothai / unsloth

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

Python 19,211 1,350 Updated Dec 12, 2024

allan ANYMS-A

Lists (4)

C++ Programming Language

LLM EVAL

Machine Learning

Neural Network Deploy

Stars