JuliaPerf / BenchmarksGame.jl

Other
43 stars 13 forks source link

make fannkuchredux-fast slighly faster #13

Closed hycakir closed 5 years ago

hycakir commented 5 years ago

This should (hopefully) run slightly faster than the latest Julia and Java implementation. This makes count_flips (the bottleneck) slightly faster. The parallelization is now done the same way as in Jeremy Zerfas' C implementation, changing block sizes, removing atomic operations and using reduction after threads join.

hycakir commented 5 years ago

Unfortunately. I forgot removing @time from the last version, amended the last commit on my branch and then forced push. I think GitHub pull requests no longer support this. Should I create new PR?