Skip to content
View SheaCai's full-sized avatar

Highlights

  • Pro

Block or report SheaCai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.

LLVM 1,298 760 Updated Mar 12, 2025

This is the implementation of the paper [Optimus: Towards Optimal Layer-Fusion on Deep Learning Processors].

Python 9 6 Updated May 10, 2021

for Data Science class on Coursera

493 145 Updated Nov 26, 2019

Module to Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elastic quotas - Effortless optimization at its finest!

Go 647 36 Updated Apr 21, 2024

《Machine Learning Systems: Design and Implementation》- Chinese Version

TeX 4,300 450 Updated Apr 13, 2024

System for AI Education Resource.

Python 3,887 485 Updated Oct 25, 2024

📚 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计

179,291 51,205 Updated Aug 21, 2024

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 54,608 11,872 Updated Mar 5, 2025

Benchmarks for popular CNN models

Python 2,526 407 Updated Sep 25, 2017

The lcc retargetable ANSI C compiler

C 2,079 453 Updated Oct 6, 2024
Racket 86 6 Updated Jun 30, 2022

A scheduler for spatial DNN accelerators that generate high-performance schedules in one shot using mixed integer programming (MIP)

Python 79 19 Updated Aug 28, 2023

A template project for beginning new Chisel work

Scala 624 187 Updated Jan 30, 2025

A Spatial Accelerator Generation Framework for Tensor Algebra.

Verilog 55 9 Updated Dec 3, 2021

IC implementation of Systolic Array for TPU

Verilog 197 26 Updated Oct 21, 2024

Accelerate Linear Equation System Solver on DE1-SoC development Board

C 1 Updated Aug 7, 2019

Vitis AI is Xilinx’s development stack for AI inference on Xilinx hardware platforms, including both edge devices and Alveo cards.

Python 1,554 642 Updated Sep 12, 2024

Transformer related optimization, including BERT, GPT

C++ 6,077 901 Updated Mar 27, 2024
Python 70 13 Updated Mar 22, 2020

Count the MACs / FLOPs of your PyTorch model.

Python 4,962 529 Updated Jul 8, 2024

Model summary in PyTorch similar to `model.summary()` in Keras

Python 4,034 415 Updated Mar 2, 2024

A compiler from AI model to RTL (Verilog) accelerator in FPGA hardware with auto design space exploration.

Verilog 412 102 Updated Dec 2, 2019

Intermediate Language (IL) for Hardware Accelerator Generators

Rust 518 53 Updated Mar 11, 2025

Tengine is a lite, high performance, modular inference engine for embedded device

C++ 4,449 971 Updated Mar 6, 2025

XLS: Accelerated HW Synthesis

C++ 1,252 187 Updated Mar 11, 2025

A home for Genesis2 sources.

Perl 41 12 Updated Feb 18, 2025

Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm

C++ 200 28 Updated Dec 11, 2024

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

C++ 979 163 Updated Sep 19, 2024
Next