Greater magic/shift value than with gcc

ridiculousfish / libdivide

Official git repository for libdivide: optimized integer division

http://libdivide.com

Other

1.09k stars 77 forks source link

Greater magic/shift value than with gcc #66

Closed gsauthof closed 3 years ago

gsauthof commented 3 years ago

Probably doesn't make much of a difference for most uses - and I don't know if it's a libdivide goal to minimize returned magic/shift values.

I noticed that sometimes libdivide returns greater values than what gcc comes up with (when using a static divisor).

Example:

divisor: 659 (64 bit unsigned)

source        magic                     shift
---------------------------------------------
libdivide     14331916488223505960          9
gcc            1791489561027938245          6

ridiculousfish commented 3 years ago

Well spotted! It's on purpose. For N-bit division, libdivide only tries N and N+1 bit magic numbers, skipping N-1, N-2, etc. This makes the magic number computation faster (no loops) and doesn't affect division performance: an N-bit multiplication always requires the same number of clock cycles, no matter how large the value is.

gcc generates the smallest magic number by looping over possible shifts. The only real benefit here is code size: if you generate a smaller literal it may take up less space in the instruction stream. This makes sense for an offline compiler like gcc, not for libdivide.

ridiculousfish commented 3 years ago

Closing this, computing a smaller magic number has no real benefit for libdivide.