-
BUPT & Xidian
- Beijing
Stars
The simplest, fastest repository for training/finetuning medium-sized GPTs.
[ECCV2024] Overcoming Distribution Mismatch in Quantizing Image Super-Resolution Networks
List of papers related to neural network quantization in recent AI conferences and journals.
Official implementation for paper LIMPQ, "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance", ECCV 2022
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…
校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step
[CVPR'21] Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-view Transformation
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Skeleton-based action recognition models in PyTorch, including Two-Stream CNN, HCN, HCN-Baseline, Ta-CNN and Dynamic GCN
[CVPR 2022] AlignQ: Alignment Quantization with ADMM-based Correlation Preservation
A Out-of-box PyTorch Scaffold for Neural Network Quantization-Aware-Training (QAT) Research. Website: https://github.com/zhutmost/neuralzip
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…
Code of paper Exploring Task Difficulty for Few-Shot Relation Extraction. https://arxiv.org/abs/2109.05473
(TPAMI2022) The ImageNet-S benchmark/method for large-scale unsupervised/semi-supervised semantic segmentation.
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
Neural Network Compression Framework for enhanced OpenVINO™ inference
PyTorch implementation of [1412.6553] and [1511.06530] tensor decomposition methods for convolutional layers.
An official implementation of "Network Quantization with Element-wise Gradient Scaling" (CVPR 2021) in PyTorch.
An official PyTorch implementation of the paper "Distance-aware Quantization", ICCV 2021.
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)