AllenDou

Follow

AllenDou AllenDou

Follow

High Performance Gateway(L7) for Alipay, SSL offload, Intel QAT/Cavium Nitrox tech, Altera/Xilinx FPGA, AliRedis author, K8S/Kubeflow, AI Beginner since 2024.

19 followers · 6 following

Alibaba
Beijing
14:48 - 8h ahead

Achievements

Achievements

Pinned Loading

vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 38.6k 5.8k
AutoAWQ Public

Forked from casper-hansen/AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python
AutoFP8 Public

Forked from neuralmagic/AutoFP8

Python
llm-compressor Public

Forked from vllm-project/llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python

102 contributions in the last year

Learn how we count contributions

Less

More

Activity overview

Contributed to vllm-project/vllm, zeroine/cutlass-cute-sample, neuralmagic/AutoFP8 and 15 other repositories

Contribution activity

February 2025

Created 2 repositories

AllenDou/TinyZero Python
This contribution was made on Feb 8
AllenDou/open-r1 Python
This contribution was made on Feb 7

Reviewed 1 pull request in 1 repository

vllm-project/vllm 1 pull request

[Kernel] W8A16 Int8 inside FusedMoE
This contribution was made on Feb 14