Skip to content
View chhzh123's full-sized avatar

Highlights

  • Pro

Organizations

@cornell-zhang

Block or report chhzh123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

DL Compiler

Deep learning compilers
22 repositories

Training and serving large-scale neural networks with auto parallelization.

Python 3,103 361 Updated Dec 9, 2023

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,490 307 Updated Oct 19, 2024

Reinforcement learning environments for compiler and program optimization tasks

Python 933 130 Updated Oct 9, 2024
Python 195 57 Updated Mar 28, 2023

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,003 667 Updated Feb 25, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,233 2,158 Updated Feb 1, 2025

High-performance automatic differentiation of LLVM and MLIR.

LLVM 1,339 118 Updated Feb 22, 2025

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

1,498 165 Updated Jan 4, 2025

The Tensor Algebra SuperOptimizer for Deep Learning

C++ 696 92 Updated Jan 26, 2023

Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators

Python 107 9 Updated Oct 26, 2022

DietCode Code Release

Cuda 61 9 Updated Jul 21, 2022

👨‍💻 My PhD.

C++ 187 34 Updated Oct 5, 2022

A language and compiler for irregular tensor programs.

C++ 136 10 Updated Nov 29, 2024
MLIR 405 72 Updated Feb 25, 2025

PlaidML is a framework for making deep learning work everywhere.

C++ 4,586 398 Updated Jul 23, 2023

Machine learning compiler based on MLIR for Sophgo TPU.

C++ 676 168 Updated Feb 24, 2025

A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture

434 35 Updated Jan 15, 2025

A Python-level JIT compiler designed to make unmodified PyTorch programs faster.

Python 1,029 126 Updated Apr 17, 2024

Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and integrated with your favorite AWS services

Python 495 157 Updated Feb 6, 2025

Torch Frontend for IREE

Python 25 10 Updated Dec 21, 2023

Train very large language models in Jax.

Python 202 17 Updated Oct 21, 2023