Open carlosfmora opened 7 months ago
I think WDL is best viewed as a stacked graph. E.g. like here https://lczero.org/dev/stats/2022/ but horizontal rather than vertical.
It's a question though how to show two engines on the same axes though.
Something like this, only nicer
Usually, the evaluation graph only has one attribute, centipawns or percentage. What if there was the option to graph through LC0's WDL? That provides much more information for the analyst.
For example, take a look at the WDL graph of the 2024 Candidates in round 13, the game between Gukesh and Firouzja
Or the most incredible game of the Candidates, game between Nepo and Caruana in round 14, A do-or-die game.
Or a crazy position, How to understand the traditional evaluation of nearly 50% in that position, while with WDL we can see that a draw is highly unlikely WDL in final position:
My recommendation: two graphs