Closed ZohrehAdabi closed 3 years ago
Hey, Resnet-50 backbone downsamples the image 8 times. If the input isn't divisible by 8, the output size may end being slightly different from the input size. You can of course discard this and use e.g. scale_factor * W. But you may need to resize the GT map so that it's of the same size as the output (required for computing the training loss).
Hey, Resnet-50 backbone downsamples the image 8 times. If the input isn't divisible by 8, the output size may end being slightly different from the input size. You can of course discard this and use e.g. scale_factor * W. But you may need to resize the GT map so that it's of the same size as the output (required for computing the training loss).
Thank you.
Hi @Viresh-R, Why should the resized image be divisible by 8? Can I discard this and use e.g. scale_factor * W?