buttonmen-dev / buttonmen

Buttonmen - an online dice game
Other
16 stars 24 forks source link

make replay testing more user-friendly #2953

Open cgolubi1 opened 6 months ago

cgolubi1 commented 6 months ago

The goal of replay testing is to throw compute time rather than human time at finding logic bugs / breaking changes in new code, even if the tester doesn't know what kind of bug they're looking for. That would be easier/more reliable if i sanded down some current rough edges in the replay test rig. Specific things i have in mind:

cgolubi1 commented 4 months ago

Another thought to integrate into the above list: it would be great to have less manual work (ideally none) needed to turn random_ai generated tests into responder tests.

cgolubi1 commented 1 week ago

A silly bug: when a game runs to 200 rounds (e.g. Echo vs IIconfused) and is cancelled, RandomAI falls over.

I think what's happening is simply that the python process gets OOM-killed when it's trying to pull the entire game action log into memory and write the final game state.