Open AncientRemember opened 3 years ago
yea you are right, but I'm just being faithful to the original implementation
Can you drop the link to the paper with the better positional encoding you are referring to?
yea you are right, but I'm just being faithful to the original implementation
Can you drop the link to the paper with the better positional encoding you are referring to the link is updated
reference from Learning to encode position for transformer with continuous dynamical model in ICML 2020
the relative position embedding which used by botnet is not Inductive,this limit the generalization
it not allow inference with difference image size of which used when traing