Skip to content
View zhangmenghao's full-sized avatar

Block or report zhangmenghao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Kolors Team

Python 4,252 322 Updated Nov 13, 2024

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,729 200 Updated Mar 4, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 12,620 842 Updated Mar 11, 2025

DLRover: An Automatic Distributed Deep Learning System

Python 1,363 171 Updated Mar 11, 2025

CUDA checkpoint and restore utility

C 304 15 Updated Jan 27, 2025

ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale

C++ 326 124 Updated Feb 23, 2025

Lumina is a user-friendly tool to test the correctness and performance of hardware network stacks.

Python 21 6 Updated Jan 8, 2024

Benchmark Test Suite for RDMA Networks

C++ 53 4 Updated Apr 15, 2023

Checkpoint/Restore tool

C 3,122 633 Updated Mar 10, 2025

Initializer for KServe Cluster

Shell 1 1 Updated Jul 29, 2024

P4 codes for research projects

P4 209 57 Updated Nov 3, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 140,991 28,240 Updated Mar 11, 2025

Large Language Model (LLM) Systems Paper List

809 32 Updated Mar 11, 2025

PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for evaluation of training and inference platforms.

Python 131 63 Updated Mar 11, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,302 2,726 Updated Mar 11, 2025

Zeta is a distributed platform for developing and deploying complex, elastic, and highly available multi-tenant network services.

C 20 10 Updated Mar 31, 2023

nsfc - 国家自然科学基金项目LaTeX模版(面青地)

TeX 445 125 Updated Mar 5, 2025

NCCL Profiling Kit

Python 127 12 Updated Jul 1, 2024

Microsoft Collective Communication Library

61 6 Updated Nov 23, 2024

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 31,334 12,943 Updated Mar 11, 2025

NVIDIA Linux open GPU kernel module source

C 15,602 1,358 Updated Mar 3, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,671 1,146 Updated Mar 11, 2025

Transformer related optimization, including BERT, GPT

C++ 6,076 901 Updated Mar 27, 2024

eBPF implementation that runs on top of Windows

C 3,104 249 Updated Mar 11, 2025

A series of large language models developed by Baichuan Intelligent Technology

Python 4,125 301 Updated Nov 8, 2024

A platform for building proxies to bypass network restrictions.

Go 30,609 4,762 Updated Mar 5, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 87,785 23,569 Updated Mar 11, 2025
Next