Thanks for your great work!
I notice you use layernorm for the final features before the classifier and also for the predictions. I think it is quite uncommon in prototype learning (correct me if i am wrong).
Could you please provide some explanation for this? And if removing the two layernorm, will the performance be degraded?
Thanks for your great work! I notice you use layernorm for the final features before the classifier and also for the predictions. I think it is quite uncommon in prototype learning (correct me if i am wrong).
Could you please provide some explanation for this? And if removing the two layernorm, will the performance be degraded?
https://github.com/tfzhou/ProtoSeg/blob/1c4a7784bbce96c06fe72d55255af15e6cf1ca96/lib/models/nets/hrnet.py#L81