decoder layer里的add & norm

MCG-NJU / SparseBEV

[ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos

https://arxiv.org/abs/2308.09244

MIT License

306 stars 21 forks source link

Open Azeral0619 opened 1 month ago

Azeral0619 commented 1 month ago

为什么代码里只有layer norm没有add

afterthat97 commented 1 month ago

有吧，比如mixing末尾就有：

self attention也有，在multiHeadAttention的官方实现里面