Open HansolEom opened 4 years ago
my model has 57.1m parameters on a d0 basis with 1601 classes and 401 attributes
dataset VG , network efficientdet-d0, batch_size 2
The paper has provided a deceptive number. If you manually calculate the total number of trainable parameters for d0, it is 10.2 Million parameters (backbone + biFpn + heads).
The paper has only provided the parameters (3.9M) for biFpn + heads.
The paper doesn't mention this explicitly.
@dvlshah That is actually incorrect. See my newer issue on convolutions where I found that the paper is including the backbone (minus the classifier layer) and bifpn+ heads in its parameter calculations but that the convolutions in the bifpn+heads need to be depthwise separable (as explicitly stated in the paper for bifpn and implied heads) and you need to properly set Dclass and WClass which are set incorrectly in the current code.
From what I read in the paper, the effectivedet-d0 standard was 3.9m. But your model comes in 10.2m on a d0 basis. Did I calculate the parameters correctly?
python train.py --dataset COCO --dataset_root ~/data/coco/ --network efficientdet-d0 --batch_size 32
Loaded pretrained weights for efficientnet-b0 10261154