Fix division-by-zero #123

jay-tux · 2024-11-22T16:46:01Z

When quantizing the (input) activations to the bit-linear layer, NaNs may occur due to division by zero. This is a consequence of the formula in the original paper:
$Quant(x) = Clip(x \times \frac{Q_b}{||x||_\infty}, -Q_b + \epsilon, Q_b - \epsilon$

In the extreme case where all activations are zero, this will result in abs-max being zero, and thus a division by zero.

To fix this, I made sure to add 1e-10f to all maxes in the preset kernels. In 99.99% of cases, this will be a minor (or no) change, but in problematic cases, this avoids NaNs.

jay-tux · 2024-11-22T16:50:19Z

@microsoft-github-policy-service agree company="UGent"

Avoid zero-division

117aaa9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix division-by-zero #123

Fix division-by-zero #123

jay-tux commented Nov 22, 2024

jay-tux commented Nov 22, 2024

Fix division-by-zero #123

Are you sure you want to change the base?

Fix division-by-zero #123

Conversation

jay-tux commented Nov 22, 2024

jay-tux commented Nov 22, 2024