Skip to content
View lingzerowan's full-sized avatar

Block or report lingzerowan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

AI infra

6 repositories

SGLang is a fast serving framework for large language models and vision language models.

Python 8,244 801 Updated Jan 31, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,197 97 Updated Jan 24, 2025

Fast and memory-efficient exact attention

Python 15,242 1,439 Updated Jan 30, 2025

Development repository for the Triton language and compiler

C++ 14,221 1,752 Updated Jan 31, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 35,704 5,411 Updated Jan 31, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 4,924 569 Updated Oct 22, 2024