Open Rishabh-S1899 opened 3 months ago
Hi, the .net
is the trained model. . net_swa
contains the Stochastic Weight Averaging of the model during training. In some cases, the average prevents overfitting, so I recommend checking both on the task.
We have finetuned the passt_s_swa_p16_s16_128_ap473 model on Dcase 2020 dataset for scene classificiation. Now we are trying to use the finetuned model by loading params from ckpt file using state dictionary. But it says it has two types of params .net and.net_swa. Which params are we supposed to use for the architecture