Repeated imports - Githubissues

mikarubi / voluseg

pipeline for volumetric cell segmentation

MIT License

11 stars 9 forks source link

This is a problem with numpy multithreading, rather than spark. After numpy is imported, one cannot change the number of requested threads, which leads to problems during spark parallelism.

It should work if we move these exact blocks to the top of the script. The key is to always set the environment variables before numpy is imported. Also, we may need to watch out for other modules that import numpy indirectly. So, in general, this change has to be made carefully, and we need to test the performance to see whether multithreading appears during parallelism.

mikarubi / voluseg

Repeated imports #20