Performance evaluation during training

werner-duvaud / muzero-general

MuZero

MIT License

2.49k stars 611 forks source link

Hi,

I have created a poker envirnoment and I want to train an agent with this implementation. In these occasions, the training proccess can take really long and also the training via self-play does not show any clear signs about the agent's performance. For these reasons, I was considering testing the agent at regular intervals against a random or a different trained agent and show the results in tensorboard for better monitoring. Because in your implementation the training process is continuous, is there way to apply this kind of evaluation?

Thank you in advance.

werner-duvaud / muzero-general

Performance evaluation during training #94