Hello. Thanks for your work on image paragraph captioning and thanks for sharing the code.
In table 1, you report the xe results with repetition penality achieves better scores. So do you initialize the self-critical training from the xe model trained with the repetition penalty or without it? In your code, the xe training (_forward function) is without the repetition penalty, its only in self-critical training (when using _sample function. That means you initialize the self-critical training with the xe model trained without repetition penalty.
Hello. Thanks for your work on image paragraph captioning and thanks for sharing the code. In table 1, you report the xe results with repetition penality achieves better scores. So do you initialize the self-critical training from the xe model trained with the repetition penalty or without it? In your code, the xe training (
_forward
function) is without the repetition penalty, its only in self-critical training (when using_sample
function. That means you initialize the self-critical training with the xe model trained without repetition penalty.