Skip to content
View zzqiuzz's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Shanhai
  • 02:57 (UTC +08:00)

Block or report zzqiuzz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

FlashMLA: Efficient MLA decoding kernels

C++ 11,093 756 Updated Mar 1, 2025

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 7,047 1,945 Updated Mar 4, 2025

A framework for few-shot evaluation of language models.

Python 8,102 2,165 Updated Mar 4, 2025

A unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deploym…

Python 759 55 Updated Mar 3, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,269 835 Updated Jun 10, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,198 6,021 Updated Mar 4, 2025

Fast and memory-efficient exact attention

Python 16,074 1,522 Updated Mar 4, 2025

《Machine Learning Systems: Design and Implementation》- Chinese Version

TeX 4,286 449 Updated Apr 13, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 14,870 1,719 Updated Mar 2, 2025

Model Quantization Benchmark

Python 789 142 Updated Jan 20, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 140,589 28,187 Updated Mar 4, 2025

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 31,177 12,868 Updated Mar 4, 2025

收集整理 GitHub 上高质量、有趣的开源项目。

15,671 1,751 Updated Feb 7, 2025

Making large AI models cheaper, faster and more accessible

Python 40,529 4,478 Updated Mar 4, 2025

LLM inference in C/C++

C++ 75,802 10,958 Updated Mar 4, 2025

基于Pytorch的OCR工具库,支持常用的文字检测和识别算法

Python 1,430 310 Updated Sep 2, 2024

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 46,953 8,035 Updated Mar 4, 2025

Unofficial implementation of LSQ-Net, a neural network quantization framework

Python 288 41 Updated May 8, 2024

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

2,003 217 Updated Mar 4, 2025

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

Python 1,639 245 Updated Mar 28, 2024

OpenMMLab Detection Toolbox and Benchmark

Python 30,428 9,570 Updated Aug 21, 2024

The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization

Python 115 17 Updated Jul 11, 2023

Python 开源项目之「自学编程之路」,保姆级教程:AI实验室、宝藏视频、数据结构、学习指南、机器学习实战、深度学习实战、网络爬虫、大厂面经、程序人生、资源分享。

Python 9,925 1,627 Updated Nov 26, 2024

A simple toolkit for detecting and cropping main body from pictures. Support face and saliency detection.

Python 44 6 Updated Oct 12, 2021

Pytorch code for Hybrid Coarse-fine Classification for Head Pose Estimation

Python 98 22 Updated Sep 22, 2020

ICCV2021/2019/2017 论文/代码/解读/直播合集,极市团队整理

2,298 1,408 Updated Sep 19, 2023

Open-source Windows and Office activator featuring HWID, Ohook, TSforge, KMS38, and Online KMS activation methods, along with advanced troubleshooting.

Batchfile 123,473 12,017 Updated Feb 23, 2025

Simple Flickr Image Scraper

Python 220 64 Updated Feb 4, 2025

在 oxford hand 数据集上对 YOLOv3 做模型剪枝(network slimming)

Python 1,672 434 Updated Sep 26, 2022
Next