Missed optimization in math expression: max(min(a,b),max(a,b)) == max(a,b)

zamazan4ik commented 6 years ago


Bugzilla Link	35607
Version	trunk
OS	All
Blocks	llvm/llvm-project#34959
CC	@hfinkel,@RKSimon,@rotateright
Fixed by commit(s)	r369386

Extended Description

clang(trunk) with '--std=c++17 -O3 -march=native -ffast-math' flags for this code:

#include <algorithm>

int test(int a, int b) {
    return std::max(std::min(a,b), std::max(a,b));
}

generates this assembly:

test(int, int): # @test(int, int)
  cmp esi, edi
  mov eax, edi
  cmovle eax, esi
  cmp edi, esi
  cmovl edi, esi
  cmp eax, edi
  cmovge edi, eax
  mov eax, edi
  ret

gcc(trunk) with '--std=c++17 -O3 -march=native -ffast-math':

test(int, int):
        cmp     edi, esi
        mov     eax, edi
        cmovl   eax, esi
        ret

Helpful link: https://github.com/gcc-mirror/gcc/blob/07b69d3f1cd3dd8ebb0af1fbff95914daee477d2/gcc/match.pd

RKSimon commented 2 years ago

mentioned in issue llvm/llvm-project#34959

rotateright commented 4 years ago

Assuming it sticks, the next step would be to fix SimplifyCFG to propagate FMF from phi to select.

That part is at least partly done: https://reviews.llvm.org/rGebf9bf2cbc8f

But this example is harder than I imagined: we have to propagate FMF through memory ops and/or function parameters because the min/max calls take references (pointers) as arguments. That means we don't start with a phi of FP values; it's a phi of pointers to FP values.

rotateright commented 4 years ago

Update - we have FMF on phi with: https://reviews.llvm.org/D67564

...but there was feedback that this may have unintended consequences, so posted for discussion on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2019-September/135444.html

No responses so far, but as suggested, I'm waiting to build on that until people have plenty of time to see that.

Assuming it sticks, the next step would be to fix SimplifyCFG to propagate FMF from phi to select.

RKSimon commented 5 years ago

Current Codegen: https://godbolt.org/z/kVBRNY

rotateright commented 5 years ago

All integer min/max patterns should be optimized after: https://reviews.llvm.org/rL369386

FP will have to check fast-math-flags to handle NaN and -0.0 properly. If so, we need to make sure that our FMF propagation is working as expected. In particular, we may need to extend FMF to phi nodes of FP values, so they get applied to a 'select' when we run -simplifycfg.

zamazan4ik commented 6 years ago

See this example:

#include <algorithm>

int test(float a, float b)
{
    return std::max(std::min(a,b), std::max(a,b));
}

I have changed here variables types to float and optimization failed too - clang trunk with '-O3 -ffast-math'generates this:

test(float, float):                              # @test(float, float)
        movaps  xmm2, xmm1
        minss   xmm2, xmm0
        maxss   xmm1, xmm0
        maxss   xmm1, xmm2
        cvttss2si       eax, xmm1
        ret