How to Quantize input and output? #2

JochiSt · 2025-02-28T09:22:42Z

In my application, I've a datastream, which has a defined bitformat. This should input the neural network and the output of the neural network should also have a defined bitwidth to be able to enter the next stage (which also already exists).

What I got so far is, that I'm able to fix the input bitsize by using a signature layer, but I've no clue, how to fix the output's bitwidth. I'm wondering, whether I've overseen this part in the documentation.

The text was updated successfully, but these errors were encountered:

calad0i · 2025-03-02T03:18:27Z

Sorry for the late reply. GitHub signed me out for some reason...

Fixing the output size of the datastream can only be partially done for the fractional part by setting the weight to be none-trainable, as the integer part will still be adaptive during training.

While I would not expect this to be a significant issue during training, some manual efforts will be required to fix the output size of the output pipe (e.g., overriding result_t in the converted hls4ml model.)

In the next release of this library, finer control would be possible, including fixing the quantizer sizes.

p.s.,

result_t naming maybe changed in the near future.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to Quantize input and output? #2

How to Quantize input and output? #2

JochiSt commented Feb 28, 2025

calad0i commented Mar 2, 2025 •

edited

Loading

How to Quantize input and output? #2

How to Quantize input and output? #2

Comments

JochiSt commented Feb 28, 2025

calad0i commented Mar 2, 2025 • edited Loading

calad0i commented Mar 2, 2025 •

edited

Loading