invenia / Parallelism.jl

A library for threaded and distributed parallelism.
MIT License
8 stars 1 forks source link

Worker Freezing After Error #6

Open samuel-massinon opened 4 years ago

samuel-massinon commented 4 years ago

On Invenia Slack

7 out of 30 workers errored inside robust_pmap. The errors did not meet the condition for retry so any one of those workers should've brought all the workers down. But for some reason there were still 30 workers being reported by nworkers() but CPU was running just above 2300% (making me think only 23 workers were doing anything)