Open 1920230345 opened 1 year ago
@1920230345 Hi, we did not conduct an ablation study on this. We suggest empirical experiments for different FasterNet variants, as further incorporating activation functions may increase or decrease the model capacity.
Great work! But why not use activation functions after downsampling convolutions?