Skip to content
View tspeterkim's full-sized avatar

Organizations

@EnzymeAD

Block or report tspeterkim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tutorials on tinygrad

345 26 Updated Feb 26, 2025

Fastest kernels written from scratch

Cuda 186 24 Updated Feb 15, 2025

Seamless operability between C++11 and Python

C++ 16,250 2,141 Updated Mar 3, 2025

Run compilers interactively from your web browser and interact with the assembly

TypeScript 17,096 1,819 Updated Mar 1, 2025

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 31,188 12,875 Updated Mar 5, 2025

Writing FLUX in Triton

Python 32 8 Updated Sep 22, 2024

Nvidia Instruction Set Specification Generator

Python 251 11 Updated Jul 9, 2024

Spike, a RISC-V ISA Simulator

C 2,603 907 Updated Feb 27, 2025

Official inference repo for FLUX.1 models

Python 20,581 1,447 Updated Feb 6, 2025

UNet diffusion model in pure CUDA

Cuda 598 27 Updated Jun 28, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,261 6,028 Updated Mar 5, 2025

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,855 536 Updated Dec 14, 2024

A fully compliant RISC-V computer made inside the game Terraria

Rust 3,524 45 Updated Jul 31, 2024
Jupyter Notebook 86 5 Updated Feb 29, 2024

An ML Systems Onboarding list

724 25 Updated Jan 24, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 14,221 1,165 Updated May 23, 2024

Open-source E-ink monitor. Mirror of https://gitlab.com/zephray/glider

C 1,756 48 Updated Jul 4, 2024

Tile primitives for speedy kernels

Cuda 2,098 120 Updated Mar 5, 2025

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 7,910 606 Updated Aug 18, 2024

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 28,192 3,226 Updated Mar 5, 2025

NAND is a logic simulator suite made entirely from NAND gates

TypeScript 557 15 Updated Aug 25, 2024

The official PyTorch implementation of L2CS-Net for gaze estimation and tracking

Python 368 89 Updated Feb 2, 2024

A toolkit for making real world machine learning and data analysis applications in C++

C++ 13,815 3,398 Updated Mar 4, 2025

Fast CUDA matrix multiplication from scratch

Cuda 653 86 Updated Dec 28, 2023
Cuda 151 41 Updated Mar 3, 2025

LLM training in simple, raw C/CUDA

Cuda 25,911 2,974 Updated Oct 2, 2024

Development repository for the Triton language and compiler

MLIR 14,716 1,834 Updated Mar 5, 2025
Python 11 Updated Mar 30, 2024
Python 242 3 Updated Mar 20, 2024

Puzzles for learning Triton

Jupyter Notebook 1,463 106 Updated Nov 18, 2024
Next