Skip to content
View xianml's full-sized avatar

Block or report xianml

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
16 stars written in C++
Clear filter

LLM inference in C/C++

C++ 70,056 10,116 Updated Dec 31, 2024

Cross-platform, customizable ML solutions for live and streaming media.

C++ 28,116 5,210 Updated Dec 21, 2024

Productive, portable, and performant GPU programming in Python.

C++ 26,492 2,318 Updated Dec 23, 2024

Distribute and run LLMs with a single file.

C++ 21,077 1,082 Updated Dec 14, 2024

Official inference framework for 1-bit LLMs

C++ 12,527 879 Updated Dec 20, 2024

Tensor library for machine learning

C++ 11,461 1,070 Updated Dec 23, 2024

Cataclysm - Dark Days Ahead. A turn-based survival game set in a post-apocalyptic world.

C++ 10,846 4,229 Updated Jan 2, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,049 1,044 Updated Dec 26, 2024

Redot Engine – Multi-platform 2D and 3D game engine

C++ 4,970 229 Updated Dec 29, 2024

Stable Diffusion and Flux in pure C/C++

C++ 3,649 319 Updated Dec 28, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,300 130 Updated Dec 26, 2024

chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse

C++ 2,213 77 Updated Dec 30, 2024

A SQLite extension for efficient vector search, based on Faiss!

C++ 1,771 65 Updated May 5, 2024

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

C++ 844 121 Updated Dec 20, 2024

A scalable inference server for models optimized with OpenVINO™

C++ 689 212 Updated Dec 31, 2024

C++ builds C++

C++ 24 Updated Nov 14, 2024