TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,506 1,112 Updated Feb 21, 2025

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving for Local Deployment

C++ 8,113 424 Updated Feb 19, 2025

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 6,040 899 Updated Mar 27, 2024

szad670401 / HyperLPR

基于深度学习高性能中文车牌识别 High Performance Chinese License Plate Recognition Framework.

C++ 5,833 2,028 Updated Oct 20, 2024

OpenNMT / CTranslate2

Fast inference engine for Transformer models

C++ 3,611 324 Updated Feb 10, 2025

bytedance / lightseq

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,250 332 Updated May 16, 2023

pytorch / FBGEMM

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,261 540 Updated Feb 24, 2025

Zhou-sx / yolov5_Deepsort_rknn

Track vehicles and persons on rk3588 / rk3399pro.

C++ 369 88 Updated Apr 8, 2023

grimoire / amirstan_plugin

Useful tensorrt plugin. For pytorch and mmdetection model conversion.

C++ 161 38 Updated Oct 8, 2024

Peter-Chou / libtorch_grpc_serving

pytorch during training, libtorch during serving via gRPC

C++ 21 5 Updated Sep 9, 2019

Uhao-P / rk3588_read_video

rk3588 various solutions for reading camera and video files

C++ 10 4 Updated Apr 17, 2024

Uhao-P / rk3588_metrics

测试rknn分类模型与目标检测模型指标

C++ 2 Updated May 17, 2024

Uhao-P / protobuf

Forked from protocolbuffers/protobuf

Protocol Buffers - Google's data interchange format

C++ 1 Updated Mar 3, 2021

Uhao-P / tensorrtx

Forked from wang-xinyu/tensorrtx

Implementation of popular deep learning networks with TensorRT network definition API

C++ 1 Updated Jul 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xiangyu Pan Uhao-P

Block or report Uhao-P

Stars

ggml-org / llama.cpp

nomic-ai / gpt4all

facebookresearch / faiss

ggml-org / ggml

NVIDIA / TensorRT-LLM