Skip to content
View TKH666's full-sized avatar

Highlights

  • Pro

Block or report TKH666

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

4 stars written in Cuda
Clear filter

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,158 229 Updated Jan 27, 2025

Sample codes for my CUDA programming book

Cuda 1,626 333 Updated Jul 27, 2023

[ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wang, Zhangyang Wang.

Cuda 32 6 Updated Apr 9, 2023

FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Data on GPUs

Cuda 11 4 Updated Sep 26, 2023