I have a package that uses distributed and creates a different fft plan for each process. When I build with more than 1 thread, it becomes incredibly slow to create the fft_plans (order of 12 hours for something that should take <1s), regardless of how many threads I actually use in that Julia process. For the purposes of debugging I've made a (mostly) minimal example here. I tried to make an example that works in just the RePL, but it appears that the building is an important aspect. For now I'm planning on specifying in the build that there should only be one thread, but I'm not sure how this would interact with ClusterManagers, where having both threaded ffts and different processes would definitely be useful.
Steps to reproduce:
set JULIA_NUM_THREADS=4 based on environment
using Pkg
Pkg.add("https://github.com/dsweber2/minimalFreezingExample.jl.git")
Pkg.build("minimalFreezingExample")
Pkg.test("minimalFreezingExample")
I have a package that uses distributed and creates a different fft plan for each process. When I build with more than 1 thread, it becomes incredibly slow to create the fft_plans (order of 12 hours for something that should take <1s), regardless of how many threads I actually use in that Julia process. For the purposes of debugging I've made a (mostly) minimal example here. I tried to make an example that works in just the RePL, but it appears that the building is an important aspect. For now I'm planning on specifying in the build that there should only be one thread, but I'm not sure how this would interact with
ClusterManagers
, where having both threaded ffts and different processes would definitely be useful.Steps to reproduce:
it should hang on i=2, j=1.
This is definitely related to https://github.com/JuliaMath/FFTW.jl/issues/121