Skip to content
View LinkZyy's full-sized avatar
🎯
Focusing
🎯
Focusing
  • UCAS
  • BeiJing
  • 19:11 (UTC +08:00)

Highlights

  • Pro

Block or report LinkZyy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 6,336 465 Updated Dec 19, 2024

libdrm_amdgpu bindings for Rust, and some methods ported from Mesa3D

Rust 14 1 Updated Dec 15, 2024

ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime

C++ 229 111 Updated Dec 20, 2024

Documentation of NVIDIA chip/hardware interfaces

C 1,251 92 Updated Sep 10, 2024

AMDGPU Driver with KFD used by the ROCm project. Also contains the current Linux Kernel that matches this base driver

C 338 101 Updated Dec 3, 2024

A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.

C 40 7 Updated May 29, 2022
C++ 110 51 Updated Dec 19, 2024

Let us control diffusion models!

Python 30,980 2,779 Updated Feb 25, 2024

ASCII generator (image to text, image to image, video to video)

Python 7,467 573 Updated Nov 22, 2024

Inference script for Oasis 500M

Python 1,652 141 Updated Nov 8, 2024

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,022 102 Updated Dec 20, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 1,587 160 Updated Dec 20, 2024

Dynamic Memory Management for Serving LLMs without PagedAttention

C 261 16 Updated Dec 6, 2024

DietCode Code Release

Cuda 61 9 Updated Jul 21, 2022

Sniff CUDA ioctls

C 179 25 Updated May 4, 2023

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,838 222 Updated Dec 13, 2024

open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality

Python 167 14 Updated Aug 2, 2024

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

JavaScript 12,745 44,540 Updated Dec 18, 2024

Grok open release

Python 49,738 8,342 Updated Aug 30, 2024

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,738 234 Updated Dec 20, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 22,694 2,226 Updated Dec 20, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 96,465 15,690 Updated Dec 20, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,572 590 Updated May 31, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 6,591 585 Updated Dec 19, 2024
Jupyter Notebook 14 1 Updated Jan 24, 2024

Development repository for the Triton language and compiler

C++ 13,736 1,684 Updated Dec 20, 2024

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 64,483 11,117 Updated Jul 30, 2024

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 9,809 712 Updated Dec 4, 2024

Simple Directmedia Layer

C 10,543 1,894 Updated Dec 20, 2024
Python 267 25 Updated Dec 20, 2023
Next