Skip to content
View AXAIIT's full-sized avatar

Block or report AXAIIT

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official implementation for LaCo (EMNLP 2024 Findings)

Jupyter Notebook 12 3 Updated Oct 3, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 40,245 4,932 Updated Feb 12, 2025

[ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Liu, Zhangyang Wang

Python 52 3 Updated Dec 1, 2023

Export utility for unconstrained channel pruned models

Jupyter Notebook 71 11 Updated Jul 14, 2023

A method to increase the speed and lower the memory footprint of existing vision transformers.

Python 1,007 71 Updated Jun 17, 2024

TinyFusion: Diffusion Transformers Learned Shallow

Python 73 1 Updated Dec 4, 2024

A family of compressed models obtained via pruning and knowledge distillation

320 18 Updated Nov 13, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 39,262 6,409 Updated Dec 9, 2024

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Jupyter Notebook 952 43 Updated Aug 12, 2024

Official repository for the AAAI2025 paper ( Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization through Spare-Co…

Python 25 1 Updated Jan 17, 2025

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Jupyter Notebook 2,252 150 Updated Dec 24, 2024

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Python 675 43 Updated Aug 13, 2024

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Python 763 99 Updated Aug 20, 2024

An official implementation of the paper "How Sparse Can We Prune A Deep Network: A Fundamental Limit Viewpoint".

Python 28 2 Updated Nov 13, 2024

[NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models

Python 20 1 Updated Dec 28, 2024

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,022 164 Updated Mar 27, 2024

The official code of the paper "PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction".

Python 51 Updated Jan 8, 2025

Pruning the VLLMs

Python 80 4 Updated Dec 9, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,396 2,353 Updated Aug 12, 2024

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

Python 115 6 Updated May 15, 2024

[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Python 354 14 Updated Jan 4, 2025

[NeurIPS 2024] SlimSAM: 0.1% Data Makes Segment Anything Slim

Python 321 18 Updated Nov 17, 2024

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 946 111 Updated Oct 7, 2024

[ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan Fu, Haichuan Yang, Jiayi Yuan, Meng Li, Cheng Wan, Raghurama…

Python 71 4 Updated Jul 7, 2022

Neural Network Compression Framework for enhanced OpenVINO™ inference

Python 968 244 Updated Feb 13, 2025

A collection of pre-trained, state-of-the-art models in the ONNX format

Jupyter Notebook 8,241 1,429 Updated Apr 30, 2024

Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"

Python 57 8 Updated Jun 26, 2024
Python 2 1 Updated Nov 1, 2024

Prospect Pruning: Finding Trainable Weights at Initialization Using Meta-Gradients

Python 31 1 Updated Mar 30, 2022
Next