microsoft / LongRoPE

LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.
MIT License
100 stars 10 forks source link