- Beijing, China
- weixiao-huang.github.io
Stars
DLRover: An Automatic Distributed Deep Learning System
OpenID Connect (OIDC) identity and OAuth 2.0 provider with pluggable connectors
This repository hosts the Multi-Cluster Service APIs. Providers can import packages in this repo to ensure their multi-cluster service controller implementations will be compatible with MCS data pl…
A distributed transaction framework, supports workflow, saga, tcc, xa, 2-phase message, outbox patterns, supports many languages.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
Ongoing research training transformer models at scale
Various distributed Torch benchmarks
The road to hack SysML and become an system expert
The web framework for content-driven websites. ⭐️ Star to support our work!
Fluid, elastic data abstraction and acceleration for BigData/AI applications in cloud. (Project under CNCF)
The central registry of Bazel modules for the Bzlmod external dependency system.
An awesome & curated list of best LLMOps tools for developers
Kubernetes Virtualization API and runtime in order to define and manage virtual machines.
Kata Containers is an open source project and community working to build a standard implementation of lightweight Virtual Machines (VMs) that feel and perform like containers, but provide the workl…
Your defacto guide on monorepos, and in depth feature comparisons of tooling solutions.
A Virtual Machine Monitor for modern Cloud workloads. Features include CPU, memory and device hotplug, support for running Windows and Linux guests, device offload with vhost-user and a minimal com…
Lightweight Virtualization Add-on for Kubernetes
Manages Envoy Proxy as a Standalone or Kubernetes-based Application Gateway
Carbon Language's main repository: documents, design, implementation, and related tools. (NOTE: Carbon Language is experimental; see README)
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…
vCluster - Create fully functional virtual Kubernetes clusters - Each vcluster runs inside a namespace of the underlying k8s cluster. It's cheaper than creating separate full-blown clusters and it …
Curve is a sandbox project hosted by the CNCF Foundation. It's cloud-native, high-performance, and easy to operate. Curve is an open-source distributed storage system for block and shared file stor…
ebpf-go is a pure-Go library to read, modify and load eBPF programs and attach them to various hooks in the Linux kernel.