-
Tsinghua University--> Beihang Univerisity
- Haidian, Beijing
- https://zhangmenghao.github.io/
- https://orcid.org/0000-0001-5274-5512
Stars
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale
Lumina is a user-friendly tool to test the correctness and performance of hardware network stacks.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Large Language Model (LLM) Systems Paper List
PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for evaluation of training and inference platforms.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Zeta is a distributed platform for developing and deploying complex, elastic, and highly available multi-tenant network services.
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
NVIDIA Linux open GPU kernel module source
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
Transformer related optimization, including BERT, GPT
eBPF implementation that runs on top of Windows
A series of large language models developed by Baichuan Intelligent Technology
A platform for building proxies to bypass network restrictions.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Dynolog is a telemetry daemon for performance monitoring and tracing. It exports metrics from different components in the system like the linux kernel, CPU, disks, Intel PT, GPUs etc. Dynolog also …
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
《Machine Learning Systems: Design and Implementation》- Chinese Version