Stars
Collect some CS textbooks for learning.
GLake: optimizing GPU memory management and IO transmission.
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
Implementations of Buffets, which are efficient, composable idioms for implementing Explicit Decoupled Data Orchestration.
Example models using DeepSpeed
A reading list for homomorphic encryption
A reading list for deep graph learning acceleration.
A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture
Code for ACL2022 publication Transkimmer: Transformer Learns to Layer-wise Skim
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
A tool box for MindSpore users to enhance model security and trustworthiness.
Libraries for finite field, elliptic curve, and polynomial arithmetic
NVlabs / cub
Forked from NVIDIA/cubTHIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.
A Rust library for the Marlin preprocessing zkSNARK
Neural network graphs and training metrics for PyTorch, Tensorflow, and Keras.
header only, dependency-free deep learning framework in C++14