Open ESI-SYD opened 6 months ago
The Intel math library doesn't contain the API named __imf_sqrtf_rn
, therefore we cannot at the moment lower the Triton PreciseSqrtOp
to that library function and we lower it instead to __imf_sqrtf
which does not yield a result with the higher precision that is required.
tl.math.sqrt_rn(x)
runs incorrectly on XPU, seems there is no precisesqrt_rn
implemention in libsycl-spir64-unknown-unknown.bc. RemoveRun