Skip to content

Commit

Permalink
Update mistralrs-quant readme
Browse files Browse the repository at this point in the history
  • Loading branch information
EricLBuehler committed Sep 7, 2024
1 parent 1cec176 commit cccdd27
Showing 1 changed file with 3 additions and 0 deletions.
3 changes: 3 additions & 0 deletions mistralrs-quant/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,9 @@ Quantization techniques for mistral.rs. This implements a common trait for all q
Currently supported:
- GGUF: `GgufMatMul`
- Gptq: `GptqLayer`
- Hqq: `HqqLayer`
- Unquantized (used for ISQ): `UnquantLinear`

Some kernels are copied or based on implementations in:
- https://github.com/vllm-project/vllm
- https://github.com/mobiusml/hqq

0 comments on commit cccdd27

Please sign in to comment.