Multiple versions with different threadings included because different versions are faster depending on the machine. Depending on machine, gains can be over 20% compared to original mandelbrot-fast.jl.
I would appreciate input on relative timings for each of the scripts others see on their computer. For example, I get the following timings on two of my machines using hyperfine:
laptop with Intel(R) Core(TM) i5-5300U CPU @ 2.30GHz
desktop with Intel(R) Core(TM) i5-3470 CPU @ 3.20GHz
desktop with Intel(R) Core(TM) i5-6500 CPU @ 3.20GHz
Multiple versions with different threadings included because different versions are faster depending on the machine. Depending on machine, gains can be over 20% compared to original mandelbrot-fast.jl.
I would appreciate input on relative timings for each of the scripts others see on their computer. For example, I get the following timings on two of my machines using
hyperfine
:NOTE: running mandelbrot-fast.v3.jl requires installation of https://github.com/mohamed82008/KissThreading.jl
Changes included in every version:
@simd
annotation from mandel_inner--simd is occurring at the level of mand8;@simd
doesn't hurt runtime but increases compilation time.