Starred repositories
🔍 A Hex Editor for Reverse Engineers, Programmers and people who value their retinas when working at 3 AM.
Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes.
Productive, portable, and performant GPU programming in Python.
Event-driven network library for multi-threaded Linux server in C++11
Development repository for the Triton language and compiler
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
A simple C++11 Thread Pool implementation
Ethereum miner with OpenCL, CUDA and stratum support
Self-hosted crypto trading bot (automated high frequency market making) written in C++
Optimized primitives for collective multi-GPU communication
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
A machine learning compiler for GPUs, CPUs, and ML accelerators
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
The minimal opencv for Android, iOS, ARM Linux, Windows, Linux, MacOS, HarmonyOS, WebAssembly, watchOS, tvOS, visionOS
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training
QPanda 2 is an open source quantum computing framework developed by OriginQC that can be used to build, run, and optimize quantum algorithms.