We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
SqueezeLLM
vLLMParameters
GPTQ
prefix
create_weights
gptq_marlin_24
qqq
marlin
vLLMParameter
awq
awq_marlin
gptq_marlin
PerTensorScaleParameter
BasevLLMParameter
weight_loader_v2
get_min_capability
w4a16
compressed-tensors
w8a16