Skip to content
View hedes1992's full-sized avatar

Block or report hedes1992

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

6 stars written in Cuda
Clear filter

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,855 293 Updated Mar 16, 2025

Sample codes for my CUDA programming book

Cuda 1,667 338 Updated Feb 15, 2025

Deformable ConvNets V2 (DCNv2) in PyTorch

Cuda 1,458 231 Updated Nov 18, 2022

Introduction to Parallel Programming class code

Cuda 1,313 1,139 Updated Jun 27, 2022

Distribution-Aware Coordinate Representation for Human Pose Estimation

Cuda 560 82 Updated May 17, 2024

Reproduce of CornerNet

Cuda 22 1 Updated Feb 28, 2019