DARMA-tasking / LB-analysis-framework

Analysis framework for exploring, testing, and comparing load balancing strategies
Other
3 stars 1 forks source link

Add post LB statistics #356

Closed ppebay closed 1 year ago

ppebay commented 1 year ago

In particular:

ppebay commented 1 year ago

Adding the number of shared blocks as allowed rank QOI, with toy memory problem:

Screen Shot 2023-03-19 at 9 51 52 PM Screen Shot 2023-03-19 at 9 51 58 PM
ppebay commented 1 year ago

And with the "few" challenging problem:

Screen Shot 2023-03-19 at 9 54 39 PM Screen Shot 2023-03-19 at 9 55 32 PM
ppebay commented 1 year ago

Also added the number of "uprooted" shared blocks QOI (i.e. those that are present on a rank but where not initially homed there):

Screen Shot 2023-03-19 at 10 07 39 PM Screen Shot 2023-03-19 at 10 07 48 PM
ppebay commented 1 year ago

And in contrast, the number of "homed" blocks:

Screen Shot 2023-03-19 at 10 11 50 PM Screen Shot 2023-03-19 at 10 12 34 PM
ppebay commented 1 year ago

We can also look at the above in terms of ratio (or fraction) of homed blocks relative to the total number of blocks:

Screen Shot 2023-03-19 at 10 20 51 PM Screen Shot 2023-03-19 at 10 20 58 PM
ppebay commented 1 year ago

The same as above for the "toy" problem; unsurprisingly it was able to better retain original homing than for the more challenging problem:

Screen Shot 2023-03-19 at 10 24 00 PM Screen Shot 2023-03-19 at 10 24 12 PM
ppebay commented 1 year ago

So @lifflander is it the kind of QOI that would fit the bill and provide the kind of post-LB information that we would then want to summarize with a global statistic?

nlslatt commented 1 year ago

@PPP I like these QOIs.

For the challenging example visualized above, have you printed the details like in the very first comment at the top of the page to confirm that almost nothing is still at home at iteration 8/9? That seems pretty extreme.

If the visualization is correct, it seems like there is room for improvement in the load balancer as I would expect better performance by keeping the new groupings of objects that the load balancer found but changing on which rank the entire grouping will be processed so that there might be at least one object at home on more ranks.