fixed mismatch between mask and batch dimensions

The 'zero-out-prompt-loss' is broken because of a mismatch between the mask and tgt side of the the batch I have tested the fix a a simple example:

Bonjour les amis ### Response: ｟newline｠bonjour !

I have printed in the ignore_prompt method of the LossCompute class:

# mask 
tensor([[0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1]], device='cuda:0')
# batch["tgt"] before masking
tensor([[ 82682,   3626,  87893,  17011,  94768,     26,    721,    189,   6099,
          30363,    759, 128002]], device='cuda:0')
# batch["tgt"] after masking
tensor([[   189,    189,    189,    189,    189,    189,    189,    189,   6099,
          30363,    759, 128002]], device='cuda:0')

eole-nlp / eole

fixed mismatch between mask and batch dimensions #6