junxiaosong / AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
MIT License
3.25k stars 965 forks source link

关于self.data_buffer.extend(play_data) #39

Closed huyp182 closed 6 years ago

huyp182 commented 6 years ago

play_data=[1,2],data_buffer为空队列的话,执行self.data_buffer.extend(play_data)想要的结果是data_buffer=[1,2],(1,2分别为训练数据[s,矩阵,z]),但实际结果会不会是data_buffer = [[1,2]]? 是不是应该把play data里的每一项提出来append到data buffer后面?

junxiaosong commented 6 years ago

这个在python里简单实验一下就能确定的吧

huyp182 commented 6 years ago

额,之前试验错了,是我naive了