metaspace2020 / metaspace

Cloud engine and platform for metabolite annotation for imaging mass spectrometry
https://metaspace2020.eu/
Apache License 2.0
44 stars 9 forks source link

LithopsStalledException after MemoryError Exception and restarting lihtops executors #1470

Closed sergii-mamedov closed 5 months ago

sergii-mamedov commented 5 months ago

If at least one lithops executor has an Out Of Memory exception, we restart the entire step again with twice as much RAM. When migrating to AWS, we noticed a problem with monitoring restarted lithops executors. The technical description of the problem is also considered in this issue.

sergii-mamedov commented 5 months ago

This bug was fixed in Lithops 3.1.1. All changes from our side are described in this task: #1482