kuz / DeepMind-Atari-Deep-Q-Learner

The original code from the DeepMind article + my tweaks
http://www.nature.com/nature/journal/v518/n7540/full/nature14236.html
1.83k stars 532 forks source link

question about the code......... #29

Closed foxnudt closed 7 years ago

foxnudt commented 7 years ago

network:forward(s2):float():max(2) some one can explain this function to me? the input s2 means the state ,but what does the '2' in max(2) mean? and dose the network:forward(s2):float():min(2) exist?