wejoncy

wejoncy

Achievements

VPTQ, A Flexible and Extreme low-bit quantization algorithm

Python 598 42 Updated Mar 6, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,991 251 Updated Mar 6, 2025

A profiler to disclose and quantify hardware features on GPUs.

C++ 167 22 Updated May 15, 2022

A list of awesome compiler projects and papers for tensor computation and deep learning.

分享计算机视觉每天的arXiv文章

String splitting benchmarks

C++ 40 16 Updated May 29, 2016

Classical equations and diagrams in machine learning

TeX 7,569 1,275 Updated Jul 30, 2024