Closed kerim371 closed 8 months ago
It seems to happen randomly because of BLAS multithreading when there is more than one Julia worker on the same node.
Should maybe disable it but make data time interpolation but slow sometimes without multithreading
Thank you! Disabling it with BLAS.set_num_threads(1) should probably work. I will try
I just tried to run FWI with preliminary settings in startup.jl
:
@info "STARTUP SCRIPT: $(@__FILE__ )"
using LinearAlgebra
BLAS.set_num_threads(1)
ENV["DEVITO_LANGUAGE"]="openmp"
ENV["OMP_NUM_THREADS"]=length(Sys.cpu_info())
ENV["DEVITO_LOGGING"]="INFO"
@info "Number of BLAS threads: $(BLAS.get_num_threads())"
@info "DEVITO_LANGUAGE: $(ENV["DEVITO_LANGUAGE"])"
@info "OMP_NUM_THREADS: $(ENV["OMP_NUM_THREADS"])"
@info "DEVITO_LOGGING: $(ENV["DEVITO_LOGGING"])"
and this didn't help: I got the same segfault error et 9th FWI iteration.
This won't help JUDI set BLAS num threads in it's init so need to set it to 1 after using JUDI
https://github.com/slimgroup/JUDI.jl/blob/83752f147e175608d98b2ffe2d4778d58281c1f1/src/JUDI.jl#L203
This won't help JUDI set BLAS num threads in it's init so need to set it to 1 after
using JUDI
https://github.com/slimgroup/JUDI.jl/blob/83752f147e175608d98b2ffe2d4778d58281c1f1/src/JUDI.jl#L203
Didnt know that! thank you!
@mloubout it is strange but sometimes @everywhere BLAS.set_num_threads(1)
after using JUDI
work and sometimes not.
Now I'm trying to use only 3 of 4 computational cores on each node:
addprocs(["user@10.128.0.33",
"user@10.128.0.23",
"user@10.128.0.20",
"user@10.128.0.22"],
env=["DEVITO_LANGUAGE"=>"openmp", "OMP_NUM_THREADS"=>"3", "DEVITO_LOGGING"=>"INFO"])
I hope this help
@mloubout it is strange but sometimes
@everywhere BLAS.set_num_threads(1)
afterusing JUDI
work and sometimes not. Now I'm trying to use only 3 of 4 computational cores on each node:addprocs(["user@10.128.0.33", "user@10.128.0.23", "user@10.128.0.20", "user@10.128.0.22"], env=["DEVITO_LANGUAGE"=>"openmp", "OMP_NUM_THREADS"=>"3", "DEVITO_LOGGING"=>"INFO"])
I hope this help
Helped for now
Didn't help...
Julia community thoughts on this (for the future references): https://github.com/JuliaLang/julia/issues/52154
Thanks for raising it there and the update
Hi,
I do calculations on the cloud (master node and 4 computational nodes, standard SSH cluster manager, CentOS 7).
Starting from yesterday I begin to receive segmentation fault. Before that moment probably a week I haven't encountered this error:
I used to have
julia LTS 1.6.7
but today after this problem started annoying me I updated julia using the following commads:and now
julia version is 1.9.3
but the problem still appears. I can get this error at iteration 9 or 4 or probably at any other time.I understand that the problem is unlikely related to JUDI itself but maybe you already seen that?