mrirecon / bart

BART: Toolbox for Computational Magnetic Resonance Imaging
https://mrirecon.github.io/bart/
BSD 3-Clause "New" or "Revised" License
290 stars 161 forks source link

Thread oversubscription on multicore systems #296

Open headmeister opened 1 year ago

headmeister commented 1 year ago

Hello, I encountered this error on our machine (bart v0.8 compiled from source) when using the ecalib through the python wrapper. Our machine has 128 cores and it failed with stating:

BLAS : Program is Terminated. Because you tried to allocate too many memory regions. Segmentation Fault.

This problem is most likely related to this openBLAS issue : https://github.com/awslabs/autogluon/issues/1020 When I limited the number of threads for openBLAS and OMP via an environ. variable to 32, it fixed itself. This might be an issue for others too, when bart is executed on bigger machines. I know this might not be the final solution, but can at least help in running Bart.

Best Regards, Jiri

uecker commented 1 year ago

Thanks for pointing this out!