The handler finalise methods involve initiating pandas dataframe classes (and also geopandas) from some big numpy arrays. At this step there is therefore i) a duplication of information in the arrays and tables and ii) the problem of potentially inefficient info storage in pandas.
This is sometime causing late process memory failures, eg:
11-30 12:19 elara.event_handlers INFO Finalising <class 'elara.event_handlers.VolumeCounts'>
Killed
Likelly three actions: i) delete obsolete data immediately, ii) force some pandas data types to reduce memory and iii) consider removing pandas all togther (although it is super convenient)
The handler finalise methods involve initiating pandas dataframe classes (and also geopandas) from some big numpy arrays. At this step there is therefore i) a duplication of information in the arrays and tables and ii) the problem of potentially inefficient info storage in pandas.
This is sometime causing late process memory failures, eg:
Likelly three actions: i) delete obsolete data immediately, ii) force some pandas data types to reduce memory and iii) consider removing pandas all togther (although it is super convenient)