Starred repositories
External repo for the EFAAR benchmarking paper
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Discord server https://discord.gg/HrV52MgSC2 QQ频道 https://pd.qq.com/s/1dwwmkgq4
Real-time Fallacy Detection using OpenAI whisper and ChatGPT/LLaMA/Mistral
userspace daemon to combine joy-cons from the hid-nintendo kernel driver
8-bit CUDA functions for PyTorch Rocm compatible
List USB devices and reset a USB device from the command line
Repository to download, process, and visualize local climate data from ERA5
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
ENSO-ASC: ENSO deep learning forecast model with a multivariate air-sea coupler
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A high-throughput and memory-efficient inference and serving engine for LLMs
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
QLoRA: Efficient Finetuning of Quantized LLMs
An autonomous AI agent extension for Oobabooga's web ui
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Universal LLM Deployment Engine with ML Compilation