GEOS-ESM / SMT-Nebulae

Software Modernization Team sandbox
https://geos-esm.github.io/SMT-Nebulae/
Apache License 2.0
0 stars 1 forks source link

[GEOS] Define operational and HPC metrics #66

Open FlorianDeconinck opened 5 months ago

FlorianDeconinck commented 5 months ago

Previous benchmark have been done with the "Node-to-node" metric to answer the question "can we replace a CPU node with a GPU node".

As we gear toward operation, this metric is no longer enough, should also be backed with more scientifically relevant metrics (Gridpoint, SYPD, SDPD which seems to be the GMAO preferred metric etc.).

We should also start measuring ourselves against the SCU17/18 Milan nodes and their 128 cores.

Electric consumptions and price are also previous metric we should carry.

Another angle is scaling and operational usefulness of each hardware, so that the narrative to the scientists is clear.

This process should involve the GMAO but remain lead by us as to make sure we can deliver.

Overall, pragmatism is key: we are not here to give roofline projection and peak FLOPS, we are here to deliver day-to-day usage.


FlorianDeconinck commented 2 months ago

Has part of this work we should also do projection of requirements for running bigger simulations, now and every year upward.

Per Tsengdar"

Per Laura:

FlorianDeconinck commented 1 month ago

Working on it as part of the SC24 presentation.

Science