Closed cvsik closed 2 months ago
@cvsik, how large is your molecular system? Please also check if another GPU process is consuming the GPU memory.
Hi, it's the test system in the repo, mol_single.xyz
, with 22 atoms. There are no other processes running on the GPU, just the aimnet2 one.
I'm using the following pkg versions if that's of any help:
torch 2.3.0+cu118 pypi_0 pypi
torch-cluster 1.6.3+pt23cu118 pypi_0 pypi
torchaudio 2.3.0+cu118 pypi_0 pypi
torchvision 0.18.0+cu118 pypi_0 pypi
@cvsik : please check if it is fixed now.
@cvsik : please check if it is fixed now.
You mean with this commit? No, that's a change I also did locally to be able to run at all...
Great, that seems to have fixed it. Thanks a lot for the quick response & fit! Much appreciated! 🚀
Hi, thanks for the great contribution!
While trying to run the minimal example mol_single_opt.yml, I ran into the following memory issue while aimnet tries to build the nb list apparently:
I wouldn't have expected to run out of memory that quickly on an A100 40 GB device 😁 The earlier versions (I guess without these nb lists?) ran fine, also on smaller devices.
Do you have any advice on what to do to fix this? Run the nb list creation on CPU only?
Many thanks in advance!
Best, Max