[models] Add model compression utils #5

fg-mindee · 2021-01-11T10:09:41Z

Add a doctr.models.utils module to compress existing models and improve their latency / memory load for inference purposes on CPU. Some interesting leads to investigate:

FP conversion (feat: Added TFLite, FP16 and quantization utilities #10)
Quantization (feat: Added TFLite, FP16 and quantization utilities #10)
Pruning (cf. https://www.tensorflow.org/model_optimization/guide/pruning/comprehensive_guide)
TF Lite export (feat: Added TFLite, FP16 and quantization utilities #10)
ONNX export (cf. https://github.com/onnx/keras-onnx & https://github.com/onnx/tensorflow-onnx)
Export to SaveModel (docs: Added SavedModel export instructions in doc #246)

Optional: TensorRT export (cf. https://developer.nvidia.com/blog/speeding-up-deep-learning-inference-using-tensorflow-onnx-and-tensorrt/)

The text was updated successfully, but these errors were encountered:

fg-mindee · 2021-02-02T12:56:39Z

ONNX conversion seems to be incompatible with TF 2.4.* as per onnx/keras-onnx#662. I tried on my end and encountered the same problem.
Moving this to the next release until this gets fixed!

fg-mindee · 2021-03-17T11:22:38Z

A good lead for ONNX support would be to use https://github.com/onnx/tensorflow-onnx (might have to create a savemodel to use it but it's worth a look)

felixdittrich92 · 2022-09-04T19:12:11Z

@frgfm i think we can remove the tensorrt point If we support onnx wdyt ?

frgfm · 2022-09-05T10:19:23Z

Yes sure! We'll need to take a look at pruning at some point

felixdittrich92 · 2022-09-05T10:21:06Z

yeah pruning is fine but tensorrt is a bit to much (should do the user on his own side if we can provide onnx this should be not so tricky)

fg-mindee added type: enhancement Improvement help wanted Extra attention is needed module: models Related to doctr.models labels Jan 11, 2021

fg-mindee added this to the 0.1.0 milestone Jan 11, 2021

fg-mindee self-assigned this Jan 12, 2021

fg-mindee mentioned this issue Jan 12, 2021

feat: Added TFLite, FP16 and quantization utilities #10

Merged

fg-mindee removed the type: enhancement Improvement label Jan 12, 2021

fg-mindee modified the milestones: 0.1.0, 0.2.0 Feb 2, 2021

charlesmindee added the critical High priority label Apr 2, 2021

fg-mindee mentioned this issue May 7, 2021

docs: Added SavedModel export instructions in doc #246

Merged

fg-mindee modified the milestones: 0.2.0, 0.2.1 May 7, 2021

fg-mindee modified the milestones: 0.2.1, 0.3.0 May 27, 2021

fg-mindee modified the milestones: 0.3.0, 0.3.1 Jul 1, 2021

fg-mindee added framework: pytorch Related to PyTorch backend framework: tensorflow Related to TensorFlow backend labels Jul 6, 2021

fg-mindee modified the milestones: 0.3.1, 0.4.0 Aug 26, 2021

fg-mindee removed the critical High priority label Aug 26, 2021

fg-mindee modified the milestones: 0.4.0, 0.4.1 Sep 28, 2021

fg-mindee mentioned this issue Sep 30, 2021

splitting lib into train and inference(onnx) #518

Closed

fg-mindee modified the milestones: 0.4.1, 1.0.0 Nov 12, 2021

felixdittrich92 modified the milestones: 1.0.0, 2.0.0 Jun 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[models] Add model compression utils #5

[models] Add model compression utils #5

fg-mindee commented Jan 11, 2021 •

edited by frgfm

Loading

fg-mindee commented Feb 2, 2021 •

edited

Loading

fg-mindee commented Mar 17, 2021

felixdittrich92 commented Sep 4, 2022

frgfm commented Sep 5, 2022

felixdittrich92 commented Sep 5, 2022

[models] Add model compression utils #5

[models] Add model compression utils #5

Comments

fg-mindee commented Jan 11, 2021 • edited by frgfm Loading

fg-mindee commented Feb 2, 2021 • edited Loading

fg-mindee commented Mar 17, 2021

felixdittrich92 commented Sep 4, 2022

frgfm commented Sep 5, 2022

felixdittrich92 commented Sep 5, 2022

fg-mindee commented Jan 11, 2021 •

edited by frgfm

Loading

fg-mindee commented Feb 2, 2021 •

edited

Loading