implement tournament for several agents, all-to-all n(n-1)*number-of-repetitions for now. allow to specify multiple agents on the command-line: for example agents are positional arguments, other args are key arguments.
For example,
tournament --size 20 --repeat 1000 --samples 100 uct uniform random
means 20 switches, 100 samples per node in Monte Carlo sampling algorithms, repeat each experiment 1000 times
implement tournament for several agents, all-to-all n(n-1)*number-of-repetitions for now. allow to specify multiple agents on the command-line: for example agents are positional arguments, other args are key arguments.
For example,
tournament --size 20 --repeat 1000 --samples 100 uct uniform random
means 20 switches, 100 samples per node in Monte Carlo sampling algorithms, repeat each experiment 1000 times