sacmehta / ESPNet

ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation
https://sacmehta.github.io/ESPNet/
MIT License
538 stars 111 forks source link

ESPnet V2 accuracy #69

Closed HYOJINPARK closed 4 years ago

HYOJINPARK commented 4 years ago

I am really appreciated your code. In your code, the ESPnet v2 recalls the weight of ImgNet pre-trained one. So, I remove the line and train from scratch for CityScape. However, I failed to get the same accuracy with your papers. I also used lr=0.007 (not 0.009 due to your paper) When I run train_segmentation.py, the best IOU is 47% in the log of train_segmention.py. IS the accuracy in your paper training from the pre-train weight?

sacmehta commented 4 years ago

The accuracy metric used in Cityscapes evaluation is different from what we use. You need to test accuracy using Cityscapes scripts.

Which ESPNetv2 model are you trying?

Yes, the models in the below repository uses ImageNet pretrained models. However, for Cityscapes, we didn’t notice much difference if trained from scratch. You need to train longer if you are training from scratch.

sacmehta commented 4 years ago

https://github.com/sacmehta/EdgeNets