hpcgarage / spatter

Benchmark for measuring the performance of sparse and irregular memory access.
Other
72 stars 15 forks source link

Update OpenMP Kernels #211

Closed radelja closed 6 days ago

radelja commented 1 week ago

Overview

This PR updates the OpenMP kernels to address an issue with the gather kernel and aligns them closer to their v1.1 implementations. As mentioned in #189, there is still a gap in performance between the current scatter, multiscatter, and sg OpenMP kernels on certain platforms.

✨ Change Description/Rationale

πŸ‘€ Reviewer Checklist

βœ… PR Checklist

πŸš€ TODOs

πŸ“Œ Future Work

plavin commented 6 days ago

Looks good and runs fine on my machine. Once Jeff's comment about atomics is resolved that we can merge