Tags: kustomzone/vllm
Tags
[Misc] Improve error message for incorrect pynvml (vllm-project#12809) Signed-off-by: youkaichao <[email protected]>
Disable chunked prefill and/or prefix caching when MLA is enabled (vl… …lm-project#12642) From @mgoin in vllm-project#12638 I cannot push to that branch, therefore a new PR to unblock release. --------- Signed-off-by: mgoin <[email protected]> Signed-off-by: simon-mo <[email protected]> Co-authored-by: mgoin <[email protected]>
[Bugfix] Fix Granite 3.0 MoE model loading (vllm-project#12446) Signed-off-by: DarkLight1337 <[email protected]>
Deepseek v3 (vllm-project#11502) Signed-off-by: mgoin <[email protected]> Co-authored-by: mgoin <[email protected]> Co-authored-by: robertgshaw2-neuralmagic <[email protected]>
[BugFix] Fix quantization for all other methods (vllm-project#11547)
[Bugfix] Fix request cancellation without polling (vllm-project#11190)
[Build] skip renaming files for release wheels pipeline (vllm-project… …#9671) Signed-off-by: simon-mo <[email protected]>
[Misc] bump mistral common version (vllm-project#10367) Signed-off-by: simon-mo <[email protected]>
[CI/Build] remove .github from .dockerignore, add dirty repo check (v… …llm-project#9375)
PreviousNext