ai2cm / pace

Implementation of the FV3GFS / SHiELD atmospheric model in Python
https://ai2cm.github.io/pace/
Apache License 2.0
37 stars 12 forks source link

`gt:gpu` slowdown on `gtpy` v1 due to change in allocation #412

Open FlorianDeconinck opened 1 year ago

FlorianDeconinck commented 1 year ago

gt4py v1 removes the Storage class and allow any __array_interface__ describing object to be bound. Unfortunately, the default cupy allocation used in our model has a bad stride (should have unit stride) leading to performance decrease in the backend.

Potential solution: