game playing with CNN - Githubissues

alanyuchenhou commented 8 years ago

Paper: Better Computer Go Player with Neural Network and Long-term Prediction

Scenario: predict following moves in go

Given the game history Predict the next k moves

Implementation: CNN

The CNN reads a heavily hand engineered 19x19x25 D vector (with current, historical game situation and also the opponent information encoded) and predict next k future moves at the same time

Highlights

k outputs generated concurrently instead of sequentially, useful for training, not predicting
no pooling layers
no weight constraint(no filter replications?)
the problem is substantially harder than similar image recognition problems (very high sensitivity)
1st convolutional layer is not trained, but manually generated
no well defined input layer
history is handled in a very unique way
recurrent net didn't perform well
optional search engine
this paper has too many elements
Questions
does the CNN treat next k moves as independent?
will it be better to use recurrent net to handle game history instead of hand-engineering a feature map?
will it be better to remove all hand engineered features?

ghost commented 8 years ago

Might also be interested in this deep learning approach to playing arcade games: http://www.nature.com/nature/journal/v518/n7540/full/nature14236.html The same group at Google released Tensor Flow as a tool for this kind of work. Might be worth a look. See https://www.tensorflow.org/.

alanyuchenhou commented 8 years ago

Paper: Human-level control through deep reinforcement learning

Scenario: predict actions in a arcade game

Given current game situation Predict optimum actions

Implementation: CNN

This paper is too comprehensive while the implementation details of the reinforcement learning and prediction is not clearly described.

Highlights

experience replay: randomize the ordering of inputs in the sequence to remove false correlations between inputs
iterative update: adjust action-values towards target values to reduce false correlations with target

alanyuchenhou / elephant

game playing with CNN #10

Paper: Better Computer Go Player with Neural Network and Long-term Prediction

Scenario: predict following moves in go

Implementation: CNN

Highlights

Questions

Paper: Human-level control through deep reinforcement learning

Scenario: predict actions in a arcade game

Implementation: CNN

Highlights