Open leanderloew opened 2 weeks ago
Thanks for the report!
Since this artifact only shows up in the latitudinal direction, my guess is that it is somehow related to errors in the spherical harmonic transform. Potentially L4 vs T4 GPU have different TensorCores, with slightly different numerical precision?
Can you try setting the precision for all spherical harmonic transforms to full float32? https://github.com/google-research/neuralgcm/issues/56#issuecomment-2091121455
L4 GPU:
T4 GPU:
Other variables look fine. I ran this for the neural_gcm_dynamic_forcing_stochastic_1_4_deg mode.