chk-on

chk-on

Stars

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,796 291 Updated Mar 4, 2025

The Triton TensorRT-LLM Backend

Python 797 116 Updated Mar 11, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,878 1,536 Updated Mar 12, 2025

Layout-Guided multi-view driving scene video generation with latent diffusion model

Python 514 18 Updated Dec 15, 2023

深度学习经典、新论文逐段精读

3 Updated Nov 7, 2021

深度学习入门教程, 优秀文章, Deep Learning Tutorial

Jupyter Notebook 15,387 3,702 Updated Apr 21, 2022

我在优达学城的深度学习学位课程中完成的项目

Jupyter Notebook 3 4 Updated Dec 18, 2022