Hi there,
have you tried to use the Roformer model for text generation? I want to use that, since it allows the capturing of relative positions of each word.
If you have tried it, do i have to change anything other than just loading the different model because right now, my generation is way worse than BERT what is counterintuitive! :)
Hi there, have you tried to use the Roformer model for text generation? I want to use that, since it allows the capturing of relative positions of each word. If you have tried it, do i have to change anything other than just loading the different model because right now, my generation is way worse than BERT what is counterintuitive! :)