TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream d…

Python 622 45 Updated Dec 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

binghanc

Block or report binghanc

Stars

huggingface / pytorch_block_sparse

stanford-futuredata / stk

hclhkbu / gcoospdm

ptillet / torch-blocksparse

openai / blocksparse

YulhwaKim / cutlass_tilesparse

NVIDIA / TensorRT-Model-Optimizer