Open RuABraun opened 3 years ago
hello, did you make any progress?
Yes but I did my own implementation but only got a small improvement. Not sure if it's because of a mistake from me or because of some mismatch between my training data and the evaluation data.
What is your question?
In this issue https://github.com/pytorch/fairseq/issues/3080 alex talks about rescoring of nbest lists with a transformer LM. How can one do this with fairseq?
When doing fused decoding for me the results are worse with normal sized beams and when I try to use very large ones like 1000 I get OOM (I have 64G).
I have looked around in the fairseq and flashlight repository and have not been able to find anything for rescoring.