Closed fg-mindee closed 2 years ago
cf. #536
We still need to switch predictor to active expansion
is that still an issue with #731 ?
I was reviewing this issue actually, and I'm not 100% sure. I assume it does, but we need to test if expand=True
works once #731 is merged to be safe :+1:
@charlesmindee according to the last commits we just made on the main
branch, this isn't an issue anymore I believe. Can you confirm it so we can close the last 0.5.0 issue related to rotation? :smiley:
I am not sure to get your point here: expand=True
is used only in the rotation transformation because we have to rotate the page, and it is working well in this transformation at training time, but at inference time in the predictor we don't rotate pages since we have now a rotation-aware detection model. To be sure, I checked everywhere in the predictors and it is impossible to pass the kwarg expand
. @fg-mindee
Could you specify;
expand=True
at inference time in the predictor because the predictor behaviour & flags are already taking care of this?In the first case, I believe we can close this issue :+1:
ping @charlesmindee (I think it's the first option, but I want to be sure before closing the issue)
Just to confirm @charlesmindee, we can close this right?
ping @charlesmindee :pray:
@charlesmindee ?
Yes sorry we can close that!
In our current OCR predictor, there is a potential issue with rotation at the detection stage:
expand=False
in rotations: this means that a segment at the very top right end of the image, if rotated 45°, would go out of bounds and won't become a box candidate. Here we miss potential detectionsexpand=True
: some extra care needs to be taken to rotate and reproject bounding boxes relatively to the initial image (and with relative coords). As of now, making this switch would most likely cause a lot of problems.This will require some heavy discussions to handle the matter properly in the long term.