willcrichton / corrset-benchmark

A repository to test different performance optimizations in a sparse matrix computation.
https://willcrichton.net/notes/k-corrset/
43 stars 8 forks source link

More performance improvements #4

Open cjcormier opened 10 months ago

cjcormier commented 10 months ago

The following performance improvements are in this PR and one to your indexical repo:

On a large dataset with k = 5 I get the following improvements: Change Runtime (s) Total Reduction
Base 264.923 00%
Simd Iter Skip 0 202.173 24%
Scores 193.053 27%
Manual Iter 178.496 32%

I have testing another change that replaces the clone and BitOrAssign with a bit or and separate assignment. This removes the need for the clone and reduces the runtime by a further 10s on my machine. This does require coordination between this repo and the indexical one, so it was left off of these current PRs. Let me know if that course of action sounds worth pursuing.