Open pimdh opened 1 month ago
Hi: Our codebase changed significantly recently and this has been causing small issues. For Equiformer, I've updated the checkpoints in this PR: https://github.com/FAIR-Chem/fairchem/pull/893/files For Gement-OC, can you try using this scale file: configs/odac/s2ef/scaling_factors/gemnet-oc.pt
Thanks a ton, @anuroopsriram! This solves the issue for EquiformerV2.
Unfortunately, for Gemnet, that scale file doesn't work. I try running on branch anuroopsriram-patch-1
, commit 2788d9e9c4a36263.
Get model with:
wget "https://dl.fbaipublicfiles.com/dac/checkpoints_20231018/Gemnet-OC_Direct.pt" -O /tmp/fairchem_checkpoints/Gemnet-OC_Direct.pt
Then run:
checkpoints = "/tmp/fairchem_checkpoints/Gemnet-OC_Direct.pt"
data = torch.load(checkpoints)
data["config"]["model_attributes"]["scale_file"] = "/opt/fairchem/configs/odac/s2ef/scaling_factors/gemnet-oc.pt"
torch.save(data, checkpoints)
calc = OCPCalculator(checkpoint_path=checkpoints, cpu=False)
Yields error
WARNING:root:Scale factor out_blocks.0.scale_rbf_F not found in model
WARNING:root:Scale factor out_blocks.1.scale_rbf_F not found in model
WARNING:root:Scale factor out_blocks.2.scale_rbf_F not found in model
WARNING:root:Scale factor out_blocks.3.scale_rbf_F not found in model
WARNING:root:Scale factor out_blocks.4.scale_rbf_F not found in model
...
ValueError: Scale factor parameter int_blocks.0.trip_interaction.scale_rbf.scale_factor is inconsistent with the loaded state dict.
Old: Parameter containing:
tensor(8.6465, device='cuda:0')
Actual: 0.0
This issue has been marked as stale because it has been open for 30 days with no activity.
Hi,
I have trouble loading the
EquiformerV2-IS2RE-ODAC
andGemnet-OC-IS2RE-ODAC
pretrained models.With
fairchem-core==1.2.0
, if I runI get error
This is similar to what was noted in the closed issue #727. I can't find any version of EquiformerV2 that support this kwarg.
When I run
I get error
This is similar to what was noted in closed issue #603. If I remove the scale file from the config in the checkpoint (similar to as done in PR #625), and try to do predictions, I get errors like:
Thanks!