Skip to content
View Uhao-P's full-sized avatar

Block or report Uhao-P

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
18 stars written in C++
Clear filter

LLM inference in C/C++

C++ 75,115 10,855 Updated Feb 24, 2025

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 72,584 7,919 Updated Feb 21, 2025

A library for efficient similarity search and clustering of dense vectors.

C++ 33,193 3,762 Updated Feb 22, 2025

Tensor library for machine learning

C++ 11,948 1,143 Updated Feb 12, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,506 1,112 Updated Feb 21, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,113 424 Updated Feb 19, 2025

Transformer related optimization, including BERT, GPT

C++ 6,040 899 Updated Mar 27, 2024

基于深度学习高性能中文车牌识别 High Performance Chinese License Plate Recognition Framework.

C++ 5,833 2,028 Updated Oct 20, 2024

Fast inference engine for Transformer models

C++ 3,611 324 Updated Feb 10, 2025

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,250 332 Updated May 16, 2023

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,261 540 Updated Feb 24, 2025

Track vehicles and persons on rk3588 / rk3399pro.

C++ 369 88 Updated Apr 8, 2023

Useful tensorrt plugin. For pytorch and mmdetection model conversion.

C++ 161 38 Updated Oct 8, 2024

pytorch during training, libtorch during serving via gRPC

C++ 21 5 Updated Sep 9, 2019

rk3588 various solutions for reading camera and video files

C++ 10 4 Updated Apr 17, 2024

测试rknn分类模型与目标检测模型指标

C++ 2 Updated May 17, 2024

Protocol Buffers - Google's data interchange format

C++ 1 Updated Mar 3, 2021

Implementation of popular deep learning networks with TensorRT network definition API

C++ 1 Updated Jul 1, 2021