Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

yzh119 Follow

Overview Repositories 151 Projects 0 Packages 0 Stars 760

More

Overview
Repositories
Projects
Packages
Stars

yzh119

Follow

Zihao Ye yzh119

Follow

Please move faster.

679 followers · 149 following

@flashinfer-ai
Seattle, WA
03:32 (UTC -08:00)
https://homes.cs.washington.edu/~zhye/

Achievements

Achievements

Organizations

Block or report yzh119

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Add an optional note:

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Overview Repositories 151 Projects 0 Packages 0 Stars 760

More

Overview
Repositories
Projects
Packages
Stars

Type All

Select type

All Sources Forks Archived Can be sponsored Mirrors Templates

Language All

Select language

All Cuda C++ C Python Tcl Metal HTML Shell Groovy Jupyter Notebook Go SystemVerilog Haskell Java Verilog CSS Rust Vim Script CMake Scala Julia JavaScript

Sort Last updated

Select order

Last updated Name Stars

flashinfer-dev Public
Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda Apache License 2.0 Updated Dec 16, 2024
xgrammar Public
Forked from mlc-ai/xgrammar

Efficient, Flexible and Portable Structured Generation

C++ Apache License 2.0 Updated Nov 27, 2024
NetHack Public
Forked from NetHack/NetHack

Official NetHack Git Repository

C Updated Nov 3, 2024
open-gpu-kernel-modules Public
Forked from NVIDIA/open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

C Other Updated Sep 14, 2024
sglang Public
Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python Apache License 2.0 Updated Sep 12, 2024
triton Public
Forked from triton-lang/triton

Development repository for the Triton language and compiler

C++ MIT License Updated Aug 22, 2024
kernels Public
Forked from triton-lang/kernels

Python Updated Aug 19, 2024
cutlass Public
Forked from NVIDIA/cutlass

CUDA Templates for Linear Algebra Subroutines

C++ Other Updated Jul 24, 2024
mirage Public
Forked from mirage-project/mirage

A multi-level tensor algebra superoptimizer

C++ 2 Apache License 2.0 Updated May 14, 2024
texmacs Public
Forked from texmacs/texmacs

Source Code of GNU TeXmacs, Developers Guide ==>

Tcl GNU General Public License v3.0 Updated Apr 24, 2024
mlx Public
Forked from ml-explore/mlx

MLX: An array framework for Apple silicon

C++ MIT License Updated Feb 19, 2024
pbrt-v4 Public
Forked from mmp/pbrt-v4

Source code to pbrt, the ray tracer described in the forthcoming 4th edition of the "Physically Based Rendering: From Theory to Implementation" book.

C++ Apache License 2.0 Updated Feb 17, 2024
metal-benchmarks Public
Forked from philipturner/metal-benchmarks

Apple GPU microarchitecture

Metal MIT License Updated Jan 31, 2024
nccl Public
Forked from NVIDIA/nccl

Optimized primitives for collective multi-GPU communication

C++ Other Updated Jan 9, 2024
flashinfer-ai.github.io Public
Forked from flashinfer-ai/flashinfer-ai.github.io

Project website of FlashInfer project

HTML Updated Jan 6, 2024
tvm Public
Forked from apache/tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python Apache License 2.0 Updated Jan 2, 2024
punica Public
Forked from punica-ai/punica

Serving multiple LoRA finetuned LLM as one

Python 2 Apache License 2.0 Updated Nov 27, 2023
mlc-llm Public
Forked from mlc-ai/mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Python 1 Apache License 2.0 Updated Nov 23, 2023
uwsampl.github.io Public
Forked from uwsampl/uwsampl.github.io

The UW SAMPL group's website.

HTML Other Updated Sep 5, 2023
llm-perf-bench Public
Forked from mlc-ai/llm-perf-bench

Shell Updated Aug 5, 2023
utils Public
Forked from mlc-ai/package

Python Apache License 2.0 Updated Jul 20, 2023
relax Public
Forked from mlc-ai/relax

Python 1 Apache License 2.0 Updated Jun 10, 2023
relax-sparse Public
Forked from tlc-pack/relax

Temp repo for prototyping relax(relay next), the effort will be upstreamed. We use the wiki pages on this repo to host design docs.

Python Apache License 2.0 Updated Jun 10, 2023
tlcpack Public
Forked from tlc-pack/tlcpack

Groovy Apache License 2.0 Updated Jun 7, 2023
web-llm Public
Forked from mlc-ai/web-llm

Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.

Python Apache License 2.0 Updated Apr 21, 2023
tvm-rfcs Public
Forked from apache/tvm-rfcs

A home for the final text of all TVM RFCs.

Apache License 2.0 Updated Apr 19, 2023
smoothquant Public
Forked from mit-han-lab/smoothquant

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python MIT License Updated Apr 12, 2023
dgsparse Public
Forked from dgSPARSE/dgSPARSE-Lib

Cuda 1 Apache License 2.0 Updated Apr 1, 2023
web-stable-diffusion Public
Forked from mlc-ai/web-stable-diffusion

Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.

Jupyter Notebook Apache License 2.0 Updated Mar 17, 2023
bibfetch Public

Fetch bibtex entries from academic search engines like dblp.

python bibtex vscode-extension

Python 3 GNU General Public License v3.0 Updated Feb 26, 2023

Previous Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.