Skip to content
View submartingales's full-sized avatar

Block or report submartingales

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

17 stars written in C++
Clear filter

LLM inference in C/C++

C++ 69,385 9,994 Updated Dec 18, 2024

ClickHouse® is a real-time analytics DBMS

C++ 38,099 6,971 Updated Dec 18, 2024

Port of OpenAI's Whisper model in C/C++

C++ 36,361 3,723 Updated Dec 17, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 31,979 3,675 Updated Dec 17, 2024

The Serenity Operating System 🐞

C++ 30,862 3,194 Updated Dec 18, 2024

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 15,005 2,957 Updated Dec 18, 2024

Development repository for the Triton language and compiler

C++ 13,713 1,683 Updated Dec 18, 2024

Tensor library for machine learning

C++ 11,381 1,062 Updated Dec 17, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,375 1,179 Updated Dec 1, 2024

Enabling the Windows Subsystem for Linux to include support for Wayland and X server related scenarios

C++ 10,303 310 Updated Oct 2, 2024

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

C++ 7,464 2,338 Updated Dec 18, 2024

Transformer related optimization, including BERT, GPT

C++ 5,938 895 Updated Mar 27, 2024

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

C++ 3,346 345 Updated Dec 17, 2024

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)

C++ 2,956 335 Updated Jul 31, 2024

Patterns and behaviors for GPU computing

C++ 1,684 280 Updated Jun 26, 2022

llm deploy project based mnn.

C++ 1,496 167 Updated Nov 5, 2024

Security and Privacy Risk Simulator for Machine Learning (arXiv:2312.17667)

C++ 371 60 Updated May 10, 2024