Skip to content
View tylsun's full-sized avatar

Block or report tylsun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
15 results for source starred repositories
Clear filter

Applied AI experiments and examples for PyTorch

Python 228 22 Updated Feb 25, 2025

Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

Rust 3,638 148 Updated Feb 8, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,537 1,118 Updated Feb 26, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 14,196 1,164 Updated May 23, 2024

Tile primitives for speedy kernels

Cuda 2,081 119 Updated Feb 26, 2025

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 28,134 3,198 Updated Feb 26, 2025

LLM training in simple, raw C/CUDA

Cuda 25,811 2,957 Updated Oct 2, 2024

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 7,001 1,938 Updated Feb 26, 2025

Step-by-step optimization of CUDA SGEMM

Cuda 288 44 Updated Mar 30, 2022

Inference Llama 2 in one file of pure C

C 18,084 2,199 Updated Aug 6, 2024

CUDA Templates for Linear Algebra Subroutines

C++ 6,760 1,109 Updated Feb 26, 2025

LLM inference in C/C++

C++ 75,376 10,896 Updated Feb 26, 2025

Tensor library for machine learning

C++ 11,972 1,146 Updated Feb 26, 2025