Model compare: figure out how to add noise while comparing - Githubissues

poja / Cattus

Cattus is a chess engine based on DeepMind AlphaZero paper, written in Rust. It uses a neural network to evaluate positions, and MCTS as a search algorithm.

4 stars 0 forks source link

Model compare: figure out how to add noise while comparing #87

Open barakugav opened 2 years ago

barakugav commented 2 years ago

Maybe sample random openings from an opening book, which are still considered equal

poja commented 2 years ago

poja commented 2 years ago

What do you think about the noisy-beginning idea? I prefer not to use external knowledge about the game (e.g. opening book)

barakugav commented 2 years ago

The noisy beginning will not result in an equalized position for the rest of the game, im not sure. Why not opening book? only to choose initial position

poja commented 2 years ago

I think it could be okay that the position is not equalized, because (a) the moves are still chosen by the players (just with some added 'luck'), (b) that's why we do many comparisons - there is some 'luck' involved I think it's much more elegant if the whole training flow has no human knowledge, or at least, no more human knowledge than in AlphaZero. Or at least, we should have a default workflow as such. (and possibly other ones too)

BTW you probably saw it, but I like this formulation in their paper

barakugav commented 2 years ago

Alright, i agree its more elegant without opening book, but im still think we should look for a better solution. We will not run hundreds of comparison games... First of all we can ensure both playres game the same 'luck' by running twice from the same noisy position with the players switched. But i still think it will cause us to miss evaluate the models

poja commented 2 years ago

Do you think we can rely here on the noise from the floating-point errors in the network activation? And multithreading

barakugav commented 2 years ago

fp no, multithreading yes, but we dont have multithreading in a single search, we have multithreading of multiple searchs, so currently it doesn't have any affect

barakugav commented 2 years ago

In the paper it say "t -> 0", so maybe they just use very small temperature, i think that reasonable