Closed cmdevries closed 9 years ago
Also compare performance before and after.
This can just be done by switching the ACCUMULATOR vector type to an atomic version. Note that whole vector operations do NOT need to be atomic, just updating a single dimension in the accumulator vector.
This turned out to be 6x slower on Clang / MacOS.
Also compare performance before and after.
This can just be done by switching the ACCUMULATOR vector type to an atomic version. Note that whole vector operations do NOT need to be atomic, just updating a single dimension in the accumulator vector.