Closed ghennequin closed 1 year ago
Apologies, I haven’t tried the code sample or looked at it deeply, but I noticed the operation uses floating point and with floating point operations generally cannot be reordered without potentially changing the results slightly due to limited floating point precision. Could that be the case here?
I'm having issues with
parallel_scan
which behaves oddly for a particular associative operator -- in particular, it doesn't return the same thing as afold_left
.parallel_scan
seems to be operating correctly on symmetric operators, e.g. sum. I'll try and look into the code forparallel_scan
to see if perhaps the arguments ofop
got swapped somewhere they shouldn't.