Commits
Branch selector
User selector
Datepicker
Commit History
Commits on Feb 22, 2025
- authored
- authored
- authored
- authored
Correction to TP logic for Mamba Mixer 2 when Num Groups not divisible by TP Size (vllm-project#13660)
authored- authored
[Bugfix] Fix benchmark script bug: inaccurate stats for vllm backend when max_model_len < input_len + output_len (vllm-project#13691)
authored- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
Commits on Feb 21, 2025
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
- authored
[Neuron][Kernel] Vectorize KV cache load in FlashPagedAttention to maximize DMA bandwidth (vllm-project#13245)
authored- authored
Commits on Feb 20, 2025
- authored
- authored
- authored
- authored