-
Statistics Department of JNU
- Guangzhou, China
-
11:21
(UTC +08:00) - https://github.com/DefTruth
- https://www.zhihu.com/people/qyjdef
Pinned Loading
-
lite.ai.toolkit
lite.ai.toolkit Public๐ A lite C++ toolkit of 100+ Awesome AI models, support ORT, MNN, NCNN, TNN and TensorRT. ๐๐
-
vllm-project/vllm
vllm-project/vllm PublicA high-throughput and memory-efficient inference and serving engine for LLMs
-
Awesome-LLM-Inference
Awesome-LLM-Inference Public๐A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. ๐๐
-
CUDA-Learn-Notes
CUDA-Learn-Notes Public๐Tensor/CUDA Cores, ๐150+ CUDA Kernels, โก๏ธโก๏ธtoy-hgemm library with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS ๐๐).
-
Awesome-Diffusion-Inference
Awesome-Diffusion-Inference Public๐A curated list of Awesome Diffusion Inference Papers with codes, such as Sampling, Caching, Multi-GPUs, etc. ๐๐
-
hgemm-tensorcores-mma
hgemm-tensorcores-mma Publicโก๏ธWrite HGEMM from scratch using Tensor Cores with WMMA, MMA PTX and CuTe API. ๐๐
Cuda 30
If the problem persists, check the GitHub status page or contact support.