andyljones / boardlaw

Scaling scaling laws with board games.
https://andyljones.com/boardlaw
MIT License
38 stars 7 forks source link

Hyperparameter tuning #5

Closed andyljones closed 3 years ago

andyljones commented 3 years ago

All the hyperparameters right now were arrived at through 'grad student descent', which is to say they're garbage. Top of the list to tune:

This should all wait until I've got a general algorithm I'm happy with though.

andyljones commented 3 years ago

I did a lot of tuning ad-hoc, and the current settings have been ossified by the fact I've done a bunch of experiments with them. Might want to vary them and do re-reuns in a future, better-resourced version of this project. Might also need to do some light variations to check we're approximately at optimal - the fact we hit quickly perfect play on 9x9 suggests we are, but reviewers will likely disagree.