Closed Erichen911 closed 4 years ago
The position embedding id indicates the position of the patch. It consists of 0 to n and 0 is used because cls_token corresponds to the first position. As shown in the following figure, the image is divided into patches and positions from 1 to n are used.
This is a great git! Thanks a lot
My question is
when embedding the image patches, why the position_embeddings and the cls_token are zeros?