ecmwf-ifs / ectrans

Global spherical harmonics transforms library underpinning the IFS
Apache License 2.0
17 stars 33 forks source link

Add logic to avoid reallocating ZCOMBUF[RS] at each call #90

Closed samhatfield closed 5 months ago

samhatfield commented 5 months ago

This is a significant optimisation of the CPU code path. Credit owed to @marsdeno.

TCO1279, 48-node benchmark (--norms --truncation 1279 --niter 100 --nlev 137 --nfld 1 --vordiv --uvders --scders -v):

Almost 2x speed-up of the median transform time with identical norms.

marsdeno commented 5 months ago

Looks good to me