Implement flexible parallelization

mala-project / mala

Materials Learning Algorithms. A framework for machine learning materials properties from first-principles data.

BSD 3-Clause "New" or "Revised" License

81 stars 25 forks source link

For large scale simulations, we need to have MALA highly-parallel. More precisely, we need to address the following:

[x] Parallel descriptor calculation
[x] Parallel network passes
[x] Parallel QE interface
[ ] Clever resource handling; What I am thinking of is an automated workflow of this sort, that is able to handle an inhomogeneous amount of ranks, i.e. using a lot of CPUs for the CPU heavy things (SNAP descriptors, QE), but fewer GPUs for the NN passes; in the best case with some kind of load manager for e.g. Monte Carlo simulations

The first two have been adressed already. For the latter, I envision something like he following:

paralell_MALA

mala-project / mala