Closed masterkni6 closed 2 years ago
OK. I've updated it using MultiHeadRelativePositionalEmbedding
from beit. But seems there are still something not matching, as my parameter counts not meeting paper reports...
Model | self defined | paper reported |
---|---|---|
CoAtNet0 | 23.86M | 25M |
CoAtNet1 | 41.05M | 42M |
CoAtNet2 | 72.67M | 75M |
CoAtNet3 | 162.46M | 168M |
CoAtNet4 | 270.90M | 275M |
Your coatnet seems to only create 1 bias weight set when the paper says 1 per head