Questions on scaling images and GT masks

In data_loader.py , pull_item function:

        region_scores = self.resizeGt(region_scores)
        affinity_scores = self.resizeGt(affinity_scores)
        confidence_mask = self.resizeGt(confidence_mask)

and the function definition of resizeGt is:

    def resizeGt(self, gtmask):
        return cv2.resize(gtmask, (self.target_size // 2, self.target_size // 2))

Why do you resize the scales to half the target size?

In the same function, you perform element-wise dividsion on region_scores and affiity_scores:

region_scores_torch = torch.from_numpy(region_scores / 255).float()
affinity_scores_torch = torch.from_numpy(affinity_scores / 255).float()

why?

random_scale uses self.target_size as the minimum dimension size and uses 1280 as the maximum. This means the image and char boxes can fit anywhere between 1280 and self.target_size. So what happens if the image is larger than 768? How do you gurantee that it will be 768? You don't seem to rescale the image after random_scale.

backtime92 / CRAFT-Reimplementation

Questions on scaling images and GT masks #39