yenchenlin / DeepLearningFlappyBird

Flappy Bird hack using Deep Reinforcement Learning (Deep Q-learning).
MIT License
6.65k stars 2.04k forks source link

Why use tf.multiply? #37

Closed guotong1988 closed 7 years ago

guotong1988 commented 7 years ago

https://github.com/yenchenlin/DeepLearningFlappyBird/blob/master/deep_q_network.py#L82-L83

I can not find the math that support the multiply operation.

guotong1988 commented 7 years ago

The Answer: Every action has a Q_value.