Open dwaydwaydway opened 3 years ago
Hi, I'm Ting-Wei,
I just read your paper and have some confusion. In the experiment for Figure 3, what does None means? If no regularization term is added, the decoding objective should degrade to MAP right? Then why does beam size affect the result?
It only degrades to MAP as the beam size goes to infinity.
Hi, I'm Ting-Wei,
I just read your paper and have some confusion. In the experiment for Figure 3, what does None means? If no regularization term is added, the decoding objective should degrade to MAP right? Then why does beam size affect the result?