Lists (4)
Sort Name ascending (A-Z)
Stars
A tool for bandwidth measurements on NVIDIA GPUs.
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
A curated awesome list of lists of interview questions. Feel free to contribute! 🎓
Windows version of NVIDIA's NCCL ('Nickel') for multi-GPU training - please use https://github.com/NVIDIA/nccl for changes.
Windows Calculator: A simple yet powerful calculator that ships with Windows
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
a light-weighted, integrated trading/backtesting system/platform(综合量化交易回测系统/平台)
Optimized primitives for collective multi-GPU communication
A baseline repository of Auto-Parallelism in Training Neural Networks
A curated list of awesome projects and papers for distributed training or inference
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A massively spiffy yet delicately unobtrusive compression library.
📡 PoC auto collect from GitHub.
UNIX-like reverse engineering framework and command-line toolset
SAFE: Self-Attentive Function Embeddings for binary similarity