Question about positional embedding

POSTECH-CVLab / point-transformer

This is an unofficial implementation of the Point Transformer paper.

508 stars 99 forks source link

Hi @zwbai,

Sorry for the late reply.

don't know why the input of the first layer is the transpose(1,2) of p_r

First of all, if i == 1 means not the first layer but the second layer since i starts from 0. Then, your question would be "why the input of the second layer (in this case, nn.BatchNorm1d) is the transpose(1, 2) of p_r?". This is how nn.BatchNorm1d works. According to the documentation, the input shape should be (N, C, L) not the (N, L, C).

Hope this helps your understanding :).

Regards,

POSTECH-CVLab / point-transformer

Question about positional embedding #52