High Performance Gateway(L7) for Alipay, SSL offload, Intel QAT/Cavium Nitrox tech, Altera/Xilinx FPGA, AliRedis author, K8S/Kubeflow, AI Beginner since 2024.
-
Alibaba
- Beijing
-
14:48
- 8h ahead
Pinned Loading
-
vllm-project/vllm
vllm-project/vllm PublicA high-throughput and memory-efficient inference and serving engine for LLMs
-
AutoAWQ
AutoAWQ PublicForked from casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Python
-
-
llm-compressor
llm-compressor PublicForked from vllm-project/llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Python
102 contributions in the last year
Day of Week | February Feb | March Mar | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | ||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More
Activity overview
Contributed to
vllm-project/vllm,
zeroine/cutlass-cute-sample,
neuralmagic/AutoFP8
and 15 other
repositories
Loading
Contribution activity
February 2025
Created 2 repositories
-
AllenDou/TinyZero
Python
This contribution was made on Feb 8
-
AllenDou/open-r1
Python
This contribution was made on Feb 7
Reviewed 1 pull request in 1 repository
vllm-project/vllm
1 pull request
-
[Kernel] W8A16 Int8 inside FusedMoE
This contribution was made on Feb 14