In the original Squeeze-and-Excitation Networks, the author found through experiments that when reduction = 16, precision and complexity are well balanced, and in your module, reduction is set to the current number of bneck output channels , what is the reason for this setting? Did it come from the experiment?
In the original Squeeze-and-Excitation Networks, the author found through experiments that when reduction = 16, precision and complexity are well balanced, and in your module, reduction is set to the current number of bneck output channels , what is the reason for this setting? Did it come from the experiment?