Open Quuxplusone opened 11 years ago
Attached neon-floating-point-precision.ll
(2738 bytes, application/octet-stream): Test file attached
This may be a way to fix the fast-math issue in PR16275.
After a long discussion on the list, this approach can be very problematic for NEON intrinsics (which require that NEON instructions be generated no matter what IEEE status or fast-math flags). Since IR doens't differentiate between code that has been produced by vectorizers or NEON intrinsics, we can't apply any serialization rule indiscriminately.
The only option left would be to have an extra command line option requesting IEEE compliance, and then it would be the user's responsibility to check the existence of NEON intrinsics, hand-crafted IR, etc.
This is also too big a hammer to fix #16275, which already has its own fix.
All in all interesting, but too low on the priority list for me to work on it.
neon-floating-point-precision.ll
(2738 bytes, application/octet-stream)