Lists (1)
Sort Name ascending (A-Z)
Stars
📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attention-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS 🎉🎉).
Grandmaster-Level Chess Without Search
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
Improve your Baduk skills by training with KataGo!
Collection of publicly available IPTV channels from all over the world
Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
What would you do with 1000 H100s...
Solve puzzles. Improve your pytorch.
Python wrapper for Xvfb, Xephyr and Xvnc
Modeling, training, eval, and inference code for OLMo
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Android in docker solution with noVNC supported and video recording
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.