Closed Waiting-TT closed 2 years ago
Yes. Because PSRO with Alpha-rank algorithm as meta solver can handle games with more than two players, model-based PSRO can also solve games with multiple players.
run_mb_psro As for leduc with 3 player, there will output one error with player_id to be -4 !!!
player_id = -4 is used to mark the end state. Please refer to network/env_model.py file.
I0726 21:31:12.822597 139731719901376 rl_environment.py:190] Using game instance: leduc_poker
Game : leduc_poker
Seed: 1
Using 1000 sims per entry.
Rectifier :
Perturbating oracle outputs : False
Sampling from marginals : True
Using <function alpharank_strategy at 0x7f13fafb3510> as strategy method.
Using <function filter_function_factory.
Process finished with exit code 1
Do you know the player_id and the size of agents when the error occurs?
psro for leduc with 3 players, player_id changes to be -4....... it seems that the run_mb_psro.py don't fit the leduc with 3 player....
It may be caused by the additional while loop condition. See the red frame. Just delete this, it can work.
The change has been committed. Sorry for the inconvenience.
mb_psro_rl_oracle can fit the poker with more than 3,4, 5 player????