Skip to content
View TomMarkQE's full-sized avatar

Block or report TomMarkQE

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
15 stars written in Cuda
Clear filter

LLM training in simple, raw C/CUDA

Cuda 25,145 2,876 Updated Oct 2, 2024

Tile primitives for speedy kernels

Cuda 1,963 100 Updated Jan 26, 2025

how to optimize some algorithm in cuda.

Cuda 1,847 153 Updated Jan 26, 2025

[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

Cuda 1,694 447 Updated Oct 9, 2023

Sample codes for my CUDA programming book

Cuda 1,626 333 Updated Jul 27, 2023

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…

Cuda 892 141 Updated Jul 29, 2023

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …

Cuda 823 198 Updated Jan 25, 2025

Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)

Cuda 664 244 Updated Aug 19, 2024

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

Cuda 334 69 Updated Sep 8, 2024

CUDA Matrix Multiplication Optimization

Cuda 155 14 Updated Jul 19, 2024

Efficient Top-K implementation on the GPU

Cuda 150 21 Updated Apr 9, 2019

GPU-Accelerated Lossless Data Compressors Survey

Cuda 112 11 Updated Sep 10, 2020
Cuda 13 3 Updated Dec 1, 2023
Cuda 13 1 Updated Sep 14, 2021