karpathy / neuraltalk

NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.
5.4k stars 1.32k forks source link

Confusion about an equation in the paper #38

Open sunshineatnoon opened 8 years ago

sunshineatnoon commented 8 years ago

Thanks for making the code public. This is a great work! This issue is not about the code, but I feel a little confused about the 11th equation in the paper, since the relevant code is not available. eq What does i indicates in the above equation? and does t refer to an index of an image fragment or a sentence fragment? Also maybe maximizing this term makes more sense? I would really appreciate it if you can point out my misunderstandings here. Thanks!