kblomdahl / dream-go

Artificial go player based on reinforcement and supervised learning
Apache License 2.0
47 stars 8 forks source link

About MCTSnet #41

Closed tensorstart closed 5 years ago

tensorstart commented 5 years ago

Hello, have you successfully replaced MCTS with MCTSnet?I have applied MCTSnet in the Sokoban, which is a GYM based environment, but the effect is not very good. The accuracy rate of each step is only 70%, and the victory rate is only 3%. Could you tell me how your effect is,thx.

kblomdahl commented 5 years ago

Hi, I have not had much success with MCTSnet. I have a half implementation [1] but I have encountered numerous problems that made me pause the implementation for the moment:

I have not taken the project further than the training step, so I do not know how it would affect the actual playing strength.

[1] https://github.com/Chicoryn/dream-go/tree/dev/mctsnet/contrib/mctsnet

tensorstart commented 5 years ago

Thank you for your reply, we have encountered similar problems with you. Our GPU utilization is also not high. Although Deepmind's ideas are interesting, they don't give any reference code. We were surprised to see that they used MCTSnet to perform a simulation training 3 million times in a Sokoban environment to achieve a 60% success rate.