Stars
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
A framework for few-shot evaluation of language models.
A unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deploym…
QLoRA: Efficient Finetuning of Quantized LLMs
A high-throughput and memory-efficient inference and serving engine for LLMs
Fast and memory-efficient exact attention
《Machine Learning Systems: Design and Implementation》- Chinese Version
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
Making large AI models cheaper, faster and more accessible
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
Unofficial implementation of LSQ-Net, a neural network quantization framework
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
OpenMMLab Detection Toolbox and Benchmark
The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization
Python 开源项目之「自学编程之路」,保姆级教程:AI实验室、宝藏视频、数据结构、学习指南、机器学习实战、深度学习实战、网络爬虫、大厂面经、程序人生、资源分享。
A simple toolkit for detecting and cropping main body from pictures. Support face and saliency detection.
Pytorch code for Hybrid Coarse-fine Classification for Head Pose Estimation
ICCV2021/2019/2017 论文/代码/解读/直播合集,极市团队整理
Open-source Windows and Office activator featuring HWID, Ohook, TSforge, KMS38, and Online KMS activation methods, along with advanced troubleshooting.
在 oxford hand 数据集上对 YOLOv3 做模型剪枝(network slimming)