Lists (3)
Sort Name ascending (A-Z)
Stars
7
stars
written in Cuda
Clear filter
FlashInfer: Kernel Library for LLM Serving
llama3.cuda is a pure C/CUDA implementation for Llama 3 model.
📝 Some source code about matrix multiplication implementation on CUDA