Input pipeline fully out of core

apax-hub / apax

A flexible and performant framework for training machine learning potentials.

MIT License

12 stars 1 forks source link

Input pipeline fully out of core #251

Open M-R-Schaefer opened 3 months ago

M-R-Schaefer commented 3 months ago

While the recent input pipeline rework has already greatly reduced memory consumption, we can defuinitely decrease it further. The current major RAM hog is the dict of ASE atoms, not the data pipeline. We can avoid this by reading the file on the fly in the generator instead of all at once, thereby avoiding the need to keep all atoms in memory.