Closed kekekawaii2839 closed 11 months ago
Hi @kekekawaii2839 , Thank you for your interest in our work.
I think we made a mistake in the command line,
can you please try removing the flag src/configs/model/bart_base_sled.json
?
Also, since you are training with long context, it will make the most sense to also test with long inputs by adding --test_unlimiformer --eval_max_source_length 999999
Let me know how it goes. Best, Uri
Great! It works! Thanks for your wonderful work again!
Hi, I tried to train with inputs longer than 1024 on bart using the following command:
And I got a lot of error like this:
But as long as
max_source_length
is smaller than 1024, I can train the model successfully. Any clues on that?