Closed Muennighoff closed 3 years ago
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.
Please note that issues that do not follow the contributing guidelines are likely to be ignored.
i have the same exact problem when i use do_sample=True
can you re-open this issue?
Maybe @gante has an idea!
Hi there @Muennighoff @zeke-john 👋
I've run the script above for both models on v4.5.1
(and on v4.22.dev0
) and it works with no problems -- you can see a colab here.
A potential cause for errors may be GPU memory -- generation with num_beams
is memory intensive. Let me know if you have more details about your problem :)
When using "sshleifer/distilbart-cnn-6-6" & do_sample the below code errors out, meanwhile the same code works for "sshleifer/distilbart-xsum-6-6". Am I missing something really obvious here? Thanks for any help!
Tranformers: 4.5.1