Closed xnuohz closed 1 year ago
Hi @xnuohz
You can use smaller L_{max} (e.g., "128x0e+128x1e+128x2e" -> "128x0e+128x1e"), smaller numbers of channels, or smaller numbers of blocks.
Besides, using a smaller number of maximum neighbors might be helpful.
Thanks @yilunliao As you mentioned, which one do you think will influence the model's metrics most ? I typically observe runtime/model size vs. accuracy trade-off.
I am not sure. This can depend on applications or datasets. I guess reducing L_{max} from 2 to 1 would hurt the most in most of cases.
Hi, thanks for sharing the code, I'd like to try it on my own dataset.
But, unlike MD17 in which the molecules have only 12 atoms, my dataset has more atoms and it'll allocate more GPU memories. May you give me some advice to reduce the model or input size?
train and evaluate batch size are set as 8, only when the process of calculating the force is removed during the test, it will not show OOM
Thank you.