kweonwooj / papers

summary of ML papers I've read
318 stars 34 forks source link

An Actor-Critic Algorithm for Sequence Prediction #60

Open kweonwooj opened 6 years ago

kweonwooj commented 6 years ago

Abstract

Details

Personal Thoughts

Link : https://arxiv.org/pdf/1607.07086.pdf Authors : Bahdanau et al. 2017

ritujpatel commented 4 years ago

In our environment we have only sequential observation and have to generate action in sequence. How to train the the same model when we don't have output labels/true action?