mask-impl

History

Name		Name	Last commit message	Last commit date
parent directory ..
models		models
README.md		README.md
main_mask.py		main_mask.py
prune_mask.py		prune_mask.py

README.md

Mask Implementation of Network Slimming

During pruning, we set those scaling factors in BN layer which correspond to pruned channels to be 0.
When training the pruned model, in each iteration, before we call optimizer.step(), we update the gradient of those 0 scaling factors to be 0. This is achieved in BN_grad_zero function.

Pros

We don't need to introduce channel selection layer which adds to the training time.
Even if a layer is pruned to zero channels, it won't raise any error. Instead, this layer will simply output an all-0 tensor.

Cons

Not easy to compute flops and parameters.

Baseline

python main_mask.py --dataset cifar100 --arch resnet --depth 164

Sparsity

python main_mask.py --dataset cifar100 --arch resnet --depth 164 -sr --s 0.00001

Prune

python prune_mask.py --dataset cifar100 --arch resnet --depth 164 --percent 0.4 --model [PATH TO THE MODEL] --save [DIRECTORY TO STORE RESULT]

Fine-tune

python main_mask.py --dataset cifar100 --arch resnet --depth 164 --refine [DIRECTORY TO THE PRUNED MODEL]

Results

CIFAR100-Resnet-164	Baseline	Sparsity (1e-5)	Prune (40%)	Fine-tune-160(40%)	Prune(60%)	Fine-tune-160(60%)
Top1 Accuracy (%)	76.68	76.89	48.61	77.33	1.91	76.07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

mask-impl

mask-impl

README.md

Mask Implementation of Network Slimming

Pros

Cons

Baseline

Sparsity

Prune

Fine-tune

Results

Files

mask-impl

Directory actions

More options

Directory actions

More options

Latest commit

History

mask-impl

Folders and files

parent directory

README.md

Mask Implementation of Network Slimming

Pros

Cons

Baseline

Sparsity

Prune

Fine-tune

Results