Skip to content
View zhouhao03's full-sized avatar

Block or report zhouhao03

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,771 527 Updated Dec 14, 2024

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

C++ 804 329 Updated Feb 1, 2025

A high performance, high expansion, easy to use framework for AI application. 为AI应用的开发者提供一套统一的高性能、易用的编程框架,快速基于AI全栈服务、开发跨端边云的AI行业应用,支持GPU,NPU加速。

C++ 144 41 Updated Jun 6, 2024

A library for high performance deep learning inference on NVIDIA GPUs.

C++ 552 67 Updated Jan 29, 2022

Python Single Object Tracking Evaluation

Python 426 69 Updated Jul 20, 2019

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 65,433 6,992 Updated Feb 1, 2025

SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.

Python 4,466 1,110 Updated Nov 12, 2023

Build smaller, faster, and more secure desktop and mobile applications with a web frontend.

Rust 88,972 2,710 Updated Feb 1, 2025

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,886 3,425 Updated Jan 22, 2025

A natural language interface for computers

Python 58,106 4,985 Updated Jan 24, 2025

2021年最新总结,从程序员到CTO,从专业走向卓越,分享大牛企业内部pdf与PPT

11,149 3,036 Updated May 20, 2024

A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )

C++ 1,554 222 Updated Aug 27, 2024

TensorRT+YOLO系列的 多路 多卡 多实例 并行视频分析处理案例

C++ 265 50 Updated Aug 1, 2024

Common utilities for ONNX converters

Python 257 67 Updated Dec 3, 2024
C++ 23 3 Updated Apr 25, 2023

My TensorRT implimention for [ECCV2018] Distractor-aware Siamese Networks for Visual Object Tracking

Jupyter Notebook 2 Updated Oct 19, 2022
C++ 65 17 Updated Apr 1, 2022

A collection of pre-trained, state-of-the-art models in the ONNX format

Jupyter Notebook 8,200 1,423 Updated Apr 30, 2024

Actively maintained ONNX Optimizer

C++ 665 92 Updated Jan 27, 2025

Full reimplementation of siamese rpn, has 0.24 eao on vot2017.

Python 224 44 Updated Sep 9, 2021

CUDA Kernel Benchmarking Library

Cuda 551 70 Updated Nov 20, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,274 1,088 Updated Feb 1, 2025

A Toolkit to Help Optimize Large Onnx Model

Python 153 10 Updated May 16, 2024

yolort is a runtime stack for yolov5 on specialized accelerators such as tensorrt, libtorch, onnxruntime, tvm and ncnn.

Python 728 152 Updated Dec 23, 2024

Simplify your onnx model

C++ 3,963 388 Updated Sep 3, 2024

Simplify your onnx model

Python 1 Updated Apr 27, 2022

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,392 475 Updated Jan 27, 2025
Jupyter Notebook 78 9 Updated Sep 19, 2023
Next