Open KristofferC opened 3 years ago
I'll check your version's perf out, but in past versions of Julia, multithreading binary-trees was slower than non-parallel code much less the distributed version; something to do with how the allocator behaves with multi-threading afaicr.
It was faster for me with 4 threads at least.
Something like:
performs quite a bit better than the submitted benchmark since it avoids spawning a bunch of workers.