Closed tomdzh closed 3 years ago
One more thing: this behavior still persists after I fine tuned the model on my own dataset
Hi @tomdzh
first of all, unlike the original T5, mT5 is not pre-trained on any supervised downstream task (like summarization, translation etc), so generation work without fine-tuning it.
Also, it would be hard to answer why it's happening in fine-tuned model without looking at any code.
This issue has been automatically marked as stale and been closed because it has not had recent activity. Thank you for your contributions.
If you think this still needs to be addressed please comment on this thread.
Environment info
transformers
version: 4.2.2Who can help
Text Generation: @patrickvonplaten @TevenLeScao T5: @patrickvonplaten
Information
Model I am using (Bert, XLNet ...): MT5
To reproduce
Steps to reproduce the behavior:
The output I got is ['.']
Expected behavior
I tried a few input texts. The generated output always start with, which doesn't happen in t5 generation. Anyone knows how to solve it?