Ch 13 agent/alphago.py wrong code?

pocca2048 commented 3 years ago

Hi! I am reading your book and I think I found an error in Ch 13.

Why do you reset self.root here https://github.com/maxpumperla/deep_learning_and_the_game_of_go/blob/50aa0ce1cd1e638c62c5ddd6ee602db3b1c23d47/code/dlgo/agent/alphago.py#L118

before referencing it below? https://github.com/maxpumperla/deep_learning_and_the_game_of_go/blob/50aa0ce1cd1e638c62c5ddd6ee602db3b1c23d47/code/dlgo/agent/alphago.py#L119

I don't know what was the intention of it but I believe this would be wrong since that will never be executed.

pocca2048 commented 3 years ago

found out this https://github.com/maxpumperla/deep_learning_and_the_game_of_go/issues/42 so closing it

pocca2048 commented 3 years ago

This code is wrong either:

if move in self.root.children: 
    self.root = self.root.children[move]
    self.root.parent = None
else:
    self.root = AlphaGoNode()
return move

since else is never executed and self.root cannot reflect opponent's move so that it eventually makes illegal moves.

To solve this, there are two choices:

reset by self.root = AlphaGoNode() no matter what.

make a new function that when opponent makes a move, reflect it on self.root. e.g.,

def reflect_move(self, move):
if move in self.root.children: 
    self.root = self.root.children[move]
    self.root.parent = None
else:
    self.root = AlphaGoNode()

Please let me know if I'm wrong.

maxpumperla / deep_learning_and_the_game_of_go

Ch 13 agent/alphago.py wrong code? #94