Open rortizmerino opened 2 months ago
For reading/writing, there are a few tests that we can do: on the one hand, to see how many cores/processes should be used for reading/writing for optimal performance, and on the other hand whether writing multiple time steps per file would increase or decrease writing time.
Testing for timestep writing optimization:
Testing for cores/processes optimization:
More tests might be added if needed.
Two main strategies are necessary, and there are different ways to implement them. Main points go as follow:
[ ] 1. End point: For example, using seff on DefltBlue
[ ] 2. Snapshot: Like memory_profiler, cProfile and psutil
[ ] 3. Interactive: Logging into the snellius node, visualising the dask cluster(s) peformance
Specific issues will be raised for each point. Their main goal is to effectively calculate resources for each environment.