DeepRNN / image_captioning

Tensorflow implementation of "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
MIT License
781 stars 351 forks source link

How to get the logit of attention? #44

Open fei161 opened 5 years ago

fei161 commented 5 years ago

In lines 404 to 415 of the model.py file, why do you want to add the logit of the image and the logit of the hiddien state as the final logit? Why not directly multiply the image features and the hidden state as the final logit? qq 20180919225427

Why don't you convert the weighted image features into the state c of the cell, and concatenate it with word embedding as input? qq 20180920001517