AXAIIT

AXAIIT

1 follower · 0 following

Starred repositories

yangyifei729 / LaCo

Official implementation for LaCo (EMNLP 2024 Findings)

Jupyter Notebook 12 3 Updated Oct 3, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 40,245 4,932 Updated Feb 12, 2025

VITA-Group / UVC

[ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Liu, Zhangyang Wang

Python 52 3 Updated Dec 1, 2023

apple / ml-upscale

Export utility for unconstrained channel pruned models

Jupyter Notebook 71 11 Updated Jul 14, 2023

facebookresearch / ToMe

A method to increase the speed and lower the memory footprint of existing vision transformers.

Python 1,007 71 Updated Jun 17, 2024

VainF / TinyFusion

TinyFusion: Diffusion Transformers Learned Shallow

Python 73 1 Updated Dec 4, 2024

NVlabs / Minitron

A family of compressed models obtained via pruning and knowledge distillation

320 18 Updated Nov 13, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 39,262 6,409 Updated Dec 9, 2024

chongzhou96 / EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Jupyter Notebook 952 43 Updated Aug 12, 2024

scu-zjz / SparseViT

Official repository for the AAAI2025 paper ( Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization through Spare-Co…

Python 25 1 Updated Jan 17, 2025

yformer / EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Jupyter Notebook 2,252 150 Updated Dec 24, 2024

SqueezeAILab / SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Python 675 43 Updated Aug 13, 2024

IST-DASLab / sparsegpt

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Python 763 99 Updated Aug 20, 2024

QiaozheZhang / Global-One-shot-Pruning

An official implementation of the paper "How Sparse Can We Prune A Deep Network: A Fundamental Limit Viewpoint".

Python 28 2 Updated Nov 13, 2024

deepseek-ai / DeepSeek-V3

Python 83,649 13,390 Updated Feb 8, 2025

haiquanlu / AlphaPruning

[NeurIPS 2024] AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models

Python 20 1 Updated Dec 28, 2024

IST-DASLab / gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,022 164 Updated Mar 27, 2024

Cooperx521 / PyramidDrop

The official code of the paper "PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction".

Python 51 Updated Jan 8, 2025

ZhangAIPI / YOPO_MLLM_Pruning

Pruning the VLLMs

Python 80 4 Updated Dec 9, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,396 2,353 Updated Aug 12, 2024

42Shawn / LLaVA-PruMerge

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

Python 115 6 Updated May 15, 2024

pkunlp-icler / FastV

[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Python 354 14 Updated Jan 4, 2025

czg1225 / SlimSAM

[NeurIPS 2024] SlimSAM: 0.1% Data Makes Segment Anything Slim

Python 321 18 Updated Nov 17, 2024

horseee / LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 946 111 Updated Oct 7, 2024

facebookresearch / DepthShrinker

[ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan Fu, Haichuan Yang, Jiayi Yuan, Meng Li, Cheng Wan, Raghurama…

Python 71 4 Updated Jul 7, 2022

openvinotoolkit / nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

Python 968 244 Updated Feb 13, 2025

onnx / models

A collection of pre-trained, state-of-the-art models in the ONNX format

Jupyter Notebook 8,241 1,429 Updated Apr 30, 2024

luuyin / OWL

Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"

Python 57 8 Updated Jun 26, 2024

mazumder-lab / OSSCAR

Python 2 1 Updated Nov 1, 2024

mil-ad / prospr

Prospect Pruning: Finding Trainable Weights at Initialization Using Meta-Gradients

Python 31 1 Updated Mar 30, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly