Reporting NAN when training CG_water model

tummfm / difftre

Learning neural network potentials from experimental data via Differentiable Trajectory Reweighting

Apache License 2.0

29 stars 9 forks source link

Hi, thanks for your interest in DiffTRe. I did not experience NaNs with the hyperparameters provided in the notebook. Usually, the main source of NaN values comes from simulations that sample in unphysical regions, mostly when 2 atoms overlap. You can check this by computing pairwise distances for each frame in the trajectory that becomes NaN. If atoms start to overlap, you could increase the strength of the prior potential to stop this from happening. This is the first thing I would try. Another standard thing to try is to reduce the learning rate. I've increased it as much as I could to converge with the least amount of updates to save compute. Taking a smaller LR is usually a bit safer. The reweighting also introduces some amount of numerical noise, performing it in float64 is often a good measure.

tummfm / difftre

Reporting NAN when training CG_water model #7