Implement BeamSearch for decoding

elliottd / GroundedTranslation

Multilingual image description

https://staff.fnwi.uva.nl/d.elliott/GroundedTranslation/

BSD 3-Clause "New" or "Revised" License

46 stars 25 forks source link

Implement BeamSearch for decoding #2

Closed elliottd closed 8 years ago

elliottd commented 9 years ago

We currently generate a sentence by taking the argmax prediction at each timestep. However, the highest probability sentence is not always found in the argmax at each timestep. In fact, Karpathy and Fei-Fei (2015) argue that a beam of seven can increase CIDEr from 0.61 to 0.66.

We should implement a beam search decoder, where the size of the beam as a free parameter in the model.