CSU-YKF / MSFT-YOLO

Implemented some industrial product surface defect detection using improved yolov5.
MIT License
5 stars 1 forks source link

关于PatchEmbed和SwinTransformer_Layer #3

Open Salary-only-17k opened 4 months ago

Salary-only-17k commented 4 months ago

还有个问题,关于PatchEmbed和SwinTransformer_Layer。我没看到代码。

Rvosuke commented 3 months ago

swin transformer使用了shifted window加以vision transformer,会涉及新的一种patch embadding 以及相对位置编码,可以参考其原始论文自行添加实现。很抱歉我们并没有在mstf项目中完成其代码。

Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., ... & Guo, B. (2021). Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 10012-10022).