Constant folding for trivial loop does not occur for float and double


Bugzilla Link	31930
Version	trunk
OS	Linux
CC	@lesshaste,@hfinkel

Extended Description

Consider:

float f(float x[]) { float p = 1.0; for (int i = 0; i < 960; i++) p += 1; return p; }

When compiled with -march=core-avx2 -O3 -ffast-math the assembly loops round adding until it gets to 961.

However:

int f(int x[]) { int p = 1; for (int i = 0; i < 960; i++) p += 1; return p; }

gives:

f: # @f mov eax, 961 ret

I don't know how hard it would be to add the same optimization for floats and double.

As a side note, there are in fact a number of interesting details with the first (float) loop. First, if we reduce the i < 960 limit to i < 959 the loop is optimized out. Second if we change the type to 'double' this upper limit goes down to i < 479. My guess is that this corresponds to an unpeeling cost model that is incorporated into the compiler.

llvm / llvm-project

Constant folding for trivial loop does not occur for float and double #31278

Extended Description

pragma unroll