Open 5g4s opened 10 months ago
Rotary Position Embedding(RoPE) to effectively leverage the positional information. Specifically, the proposed RoPE encodes the absolute position with a rotation matrix and meanwhile incorporates the explicit relative position dependency in self-attention formulation. RoPE can leverage positional information into the learning process.
Absolute positional encoding is not always, particularly meaningful due to the common practices of short sentences and phrases together in a single context and breaking up sentences across contexts.
The existing method directly adds the position information to the context representations. Unlikely, our approach aims to derive the relative position encoding by incorporating relative position information with the rotation of context representations.
Our method encourages faster convergence.
https://arxiv.org/abs/2104.09864 https://blog.eleuther.ai/rotary-embeddings/