You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In pwlq.py, from lines 60 to 67, if the quant_bit of the middle area is set to bits, the quant_bits of tail_neg and tail_pos are set to bits-1 respectively.
You are right, I'm confused with it when I see the code for the first time. But you can look at the paper in 3.2, the last paragraph says, We emphasize that b-bit PWLQ represents FP32 values into b-bitintegers to support b-bit multiply-accumulate operations, even though in total, it has the same number of quantization levels as (b+1)-bit uniform quantization.
In
pwlq.py
, from lines 60 to 67, if the quant_bit of the middle area is set tobits
, the quant_bits of tail_neg and tail_pos are set tobits-1
respectively.Won't there be a total
num_levels
of the full range become2 * 2 ** bit
? Is this right? Or is there anything wrong with my understanding?The text was updated successfully, but these errors were encountered: