-
ai-infra-notes Public
Reading notes on the open source code of AI infrastructure --- AI Infra 的开源代码阅读笔记 (LLM, DL, HPC, CUDA, SIMD...)
1 UpdatedFeb 22, 2025 -
hpc Public
Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
-
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedFeb 21, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
-
lighteval Public
Forked from huggingface/lightevalLighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
Python MIT License UpdatedFeb 10, 2025 -
pocket-ai Public
A Portable Toolkit for deploying Edge AI and HPC (opencl, vulkan, simd, task scheduling)
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedFeb 9, 2025 -
lmdeploy Public
Forked from InternLM/lmdeployLMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Python Apache License 2.0 UpdatedJan 23, 2025 -
mlc-llm Public
Forked from mlc-ai/mlc-llmUniversal LLM Deployment Engine with ML Compilation
Python Apache License 2.0 UpdatedNov 12, 2024 -
-
tensorflow Public
Forked from tensorflow/tensorflowAn Open Source Machine Learning Framework for Everyone
C++ Apache License 2.0 UpdatedJul 24, 2024 -
tflite_micro Public
Forked from tensorflow/tflite-microInfrastructure to enable deployment of ML models to low-power resource-constrained embedded targets (including microcontrollers and digital signal processors).
C++ Apache License 2.0 UpdatedJul 17, 2024 -
TinyNeuralNetwork Public
Forked from alibaba/TinyNeuralNetworkTinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
Python MIT License UpdatedJun 3, 2024 -
-
ecas Public
ECAS is a library for edge AI computing acceleration.
-
patterns Public
A collection of architectural patterns and design patterns.
-
-
cpy Public
Notes on calling each other between C and python.
C++ Apache License 2.0 UpdatedAug 15, 2021 -
-
deeplearning-paper-notes Public
Reading notes on deep learning papers---深度学习论文阅读笔记 (2013-2018)
-
dlex-cnn Public
DIY - A deep learning framework