We want to experiment with varying batch_sizes (1, 10, 20) on
Minimax
This will be half-half split between agent and minimax.
Use AverageJoe with small architecture, 10k episodes per gen and 30 generations.
Double agent
Save agent each time we update it. Load it as its own opponent.
Benchmark against minimax or some saved model (averagejoe for instance)
I might get this to work on my pc.
batch_size
We want to experiment with varying batch_sizes (1, 10, 20) on
Minimax
This will be half-half split between agent and minimax. Use AverageJoe with small architecture, 10k episodes per gen and 30 generations.
Double agent
Save agent each time we update it. Load it as its own opponent. Benchmark against minimax or some saved model (averagejoe for instance) I might get this to work on my pc.