Closed Fateeeeee closed 3 years ago
Is there any principle to follow?
I have not done relevant experiments on large models. A possible way is to add CAs after the last 1x1 conv in each building block as done in SENet.
Is there any principle to follow?