Open ryan-minato opened 3 months ago
cc @ArthurZucker @gante
Hey @ryan-minato 👋
Thank you for opening this issue! We're pausing new rope scaling contributions for a week or so, while we refactor the code. Past that, we'd love to get a contribution 🤗
Will be fixed by #31999
@ryan-minato #31999 will include longrope 🤗
Feature request
Microsoft has introduced their microsoft/LongRoPE implementation. Unlike plug-and-play solutions, LongRoPE requires hyperparameter tuning via a genetic algorithm. This implementation is likely the same as described in the
Su
on Phi-3. Are there any plans to incorporate LongRoPE into LLaMA?Motivation
In my research on long content, I have managed to integrate LongRoPE into LLaMA with some minor code adjustments. I am curious if Huggingface is also working on integrating this feature.
Your contribution
If necessary.