ydhongHIT / DDRNet

The official implementation of "Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes"
MIT License
414 stars 53 forks source link

Question about crop augmentation during training #16

Closed mbcel closed 3 years ago

mbcel commented 3 years ago

Thank you very much for your great contribution!

I have a question about the crop augmentation that you used during training. In the paper you say that you cropped the cityscapes images to 1024x1024 during training. Considering this how did you inference on the full resolution images (2048x1024) to get the benchmark results? Did you feed in 2 crops (left and right 1024x1024 crop) into the model with same size as during training and merged both into the full image afterwards or did you feed in the full resolution image during inferencing, although training was done with smaller image crops?

ydhongHIT commented 3 years ago

Thank you very much for your great contribution!

I have a question about the crop augmentation that you used during training. In the paper you say that you cropped the cityscapes images to 1024x1024 during training. Considering this how did you inference on the full resolution images (2048x1024) to get the benchmark results? Did you feed in 2 crops (left and right 1024x1024 crop) into the model with same size as during training and merged both into the full image afterwards or did you feed in the full resolution image during inferencing, although training was done with smaller image crops?

Hi, I feed in the full resolution image during inference.

mbcel commented 3 years ago

Thank you for the fast reply!