Closed Michelvl92 closed 2 years ago
Furthermore, they also placed a CBL (Conv-Bn-Leakyrelu) before SPP and a C3 module after SPP. This looks similar on #6006 FPN_conv. And I have seen this architecture in many other papers, or was this previously the default yoloV5 architecture?
@Michelvl92 anything with LeakyReLU is old. We started YOLOv5 with this activation in v1.0 and migrated away to hardswish and then later Swish.
Residual connections are pretty basic, we already have several of them as you can see in the Concat layers. You're free to add more if you'd like. Concat modules will concat any layers with matching HW dimensions regardless of channel count. If you want the connections to bring similar amounts of information then they should naturally contain similar channel counts though.
👋 Hello, this issue has been automatically marked as stale because it has not had recent activity. Please note it will be closed if no further activity occurs.
Access additional YOLOv5 🚀 resources:
Access additional Ultralytics ⚡ resources:
Feel free to inform us of any other issues you discover or feature requests that come to mind in the future. Pull Requests (PRs) are also always welcomed!
Thank you for your contributions to YOLOv5 🚀 and Vision AI ⭐!
Search before asking
Question
According to Improving YOLOv5 with Attention Mechanism for Detecting Boulders from Planetary Images they add up to 4 connections between the backbone and neck as show in the fig below (forget that they add a 4th detection output).
They claim the following: four connections represented by the red lines are added to bring the feature information from the backbone network (152 × 152 pixels, 76 × 76 pixels, 38 × 38 pixels, 19 × 19 pixels) into the feature fusion layers in the neck network. Based on the idea of residual networks, these connections can enhance the backpropagation of gradients, avoid gradient fading, and reduce the loss of the feature information of small objects.
What are your thoughts on this improvement?
If you think this can be an improvement is the below YAML file correct? Os should I add been channel reduction (CONV 1x1) or set the channel outputs bigger?
Additional