Skip to content
View ywrmf's full-sized avatar

Block or report ywrmf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
2 stars written in C++
Clear filter

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

C++ 3,348 346 Updated Dec 17, 2024

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

C++ 1,442 100 Updated Aug 7, 2024