cschlaffner / PROTzilla2

12 stars 0 forks source link

Improve performance of "memory" mode for dataframes #493

Closed henninggaertner closed 1 month ago

henninggaertner commented 1 month ago

Issue description

Currently, while in the memory dataframe mode, the dataframes are kept in ram, which is intended, but also being written (AND overwritten!!) on every save of the run. This leads to hundreds of MBs being unnecessarily written, causing a lot of disk IO.

We intend to implement a check whether a file really needs to be (over)written and also a way to clear all unnecessary files, that no longer are of use in the current run.

Acceptance Criteria