- Seattle, WA
-
03:32
(UTC -08:00) - https://homes.cs.washington.edu/~zhye/
-
flashinfer-dev Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedDec 16, 2024 -
xgrammar Public
Forked from mlc-ai/xgrammarEfficient, Flexible and Portable Structured Generation
C++ Apache License 2.0 UpdatedNov 27, 2024 -
-
open-gpu-kernel-modules Public
Forked from NVIDIA/open-gpu-kernel-modulesNVIDIA Linux open GPU kernel module source
C Other UpdatedSep 14, 2024 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedSep 12, 2024 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedAug 22, 2024 -
-
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedJul 24, 2024 -
mirage Public
Forked from mirage-project/mirageA multi-level tensor algebra superoptimizer
-
texmacs Public
Forked from texmacs/texmacsSource Code of GNU TeXmacs, Developers Guide ==>
Tcl GNU General Public License v3.0 UpdatedApr 24, 2024 -
mlx Public
Forked from ml-explore/mlxMLX: An array framework for Apple silicon
C++ MIT License UpdatedFeb 19, 2024 -
pbrt-v4 Public
Forked from mmp/pbrt-v4Source code to pbrt, the ray tracer described in the forthcoming 4th edition of the "Physically Based Rendering: From Theory to Implementation" book.
C++ Apache License 2.0 UpdatedFeb 17, 2024 -
metal-benchmarks Public
Forked from philipturner/metal-benchmarksApple GPU microarchitecture
Metal MIT License UpdatedJan 31, 2024 -
nccl Public
Forked from NVIDIA/ncclOptimized primitives for collective multi-GPU communication
C++ Other UpdatedJan 9, 2024 -
flashinfer-ai.github.io Public
Forked from flashinfer-ai/flashinfer-ai.github.ioProject website of FlashInfer project
HTML UpdatedJan 6, 2024 -
tvm Public
Forked from apache/tvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Python Apache License 2.0 UpdatedJan 2, 2024 -
punica Public
Forked from punica-ai/punicaServing multiple LoRA finetuned LLM as one
-
mlc-llm Public
Forked from mlc-ai/mlc-llmEnable everyone to develop, optimize and deploy AI models natively on everyone's devices.
-
uwsampl.github.io Public
Forked from uwsampl/uwsampl.github.ioThe UW SAMPL group's website.
HTML Other UpdatedSep 5, 2023 -
-
-
-
relax-sparse Public
Forked from tlc-pack/relaxTemp repo for prototyping relax(relay next), the effort will be upstreamed. We use the wiki pages on this repo to host design docs.
Python Apache License 2.0 UpdatedJun 10, 2023 -
-
web-llm Public
Forked from mlc-ai/web-llmBringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
Python Apache License 2.0 UpdatedApr 21, 2023 -
tvm-rfcs Public
Forked from apache/tvm-rfcsA home for the final text of all TVM RFCs.
Apache License 2.0 UpdatedApr 19, 2023 -
smoothquant Public
Forked from mit-han-lab/smoothquantSmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Python MIT License UpdatedApr 12, 2023 -
-
web-stable-diffusion Public
Forked from mlc-ai/web-stable-diffusionBringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
Jupyter Notebook Apache License 2.0 UpdatedMar 17, 2023 -
bibfetch Public
Fetch bibtex entries from academic search engines like dblp.