hunkim / ReinforcementZeroToAll

249 stars 132 forks source link

log 0 -> nan problem #16

Closed imcomking closed 7 years ago

imcomking commented 7 years ago

log_lik = -Y * (tf.log(tf.clip_by_value(action_pred, 1e-10 , 1.0)))