microsoft / LongRoPE

LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.
MIT License
100 stars 10 forks source link

why target_ids is input_ids' clone? #10

Closed momandai closed 3 months ago

momandai commented 3 months ago

image