andyljones / boardlaw

Scaling scaling laws with board games.
https://andyljones.com/boardlaw
MIT License
38 stars 7 forks source link

Faster league #6

Closed andyljones closed 3 years ago

andyljones commented 3 years ago

(This hinges on whether a league is even necessary)

80% of moves in each iteration are carried out by the latest version of the agent, and 20% of moves are done by a mixture of old agents. This is based on OA5's league, and it's intended to suppress cyclic behaviours. Right now though playing the challengers is really slooow because each challenger requires its own invocation of the net, and because they each only play on a handful of envs the dispatch overhead exceeds the GPU runtime.

Possible fixes:

Either way, I realise I need a test suite for the league before I go any further with any of these. What's a probe env for leagues look like?

andyljones commented 3 years ago

No longer think the league is necessary on any of the board sizes I care about. RIP all of this.