Open csanadpoda opened 1 year ago
@csanadpoda Can you share your labeling interface with us? That will help us to determine if this is a Label Studio issue, or a configuration issue.
So I've done some digging, and the thing is it only happens in a specific case. I have Tesseract OCR as my Machine Learning engine wired in via label-studio-ml-backend, and when you add a new label it tries to read the text from within the rectangle. It's all working fine, EXCEPT if you delete something after you've labeled it, that's when the issue arises. So by default if you label an empty space, it puts in a Form Feed character (and gets denoted as \f in the transcription of the JSON_MIN export), BUT if you delete a recognized label's text value, then you also delete the Form Feed character, and it defaults back to the "Recognized Text" placeholder.
So basically these fields are fine:
as even though they look empty, they contain the Form Feed character, however if you go in and delete the content, it regresses back to this:
and any fields that look like this will then have a bounding box in the transcription.
So for example the line for the first image in transcriptions would be:
...
"transcription": [
"\f"
]
...
the one for the second tag would be:
"transcription": [
{
"x": 39.68099960967038,
"y": 34.65627214741319,
"width": 8.313567362428842,
"height": 1.2048192771084274,
"rotation": 0,
"text": [],
"original_width": 2480,
"original_height": 3505
}
]
But since it's empty on the labeling interface, I'd expect it to be an empty string, not a bounding box dictionary.
When exporting labels in the JSON_MIN format, sometimes bounding box information is added to the transcriptions instead of the transcriptions themselves.
To Reproduce Steps to reproduce the behavior:
Expected behavior I'd expect JSON_MIN to have the following format:
Environment (please complete the following information):
Additional context I wonder if this is user error, or what may cause it, as it's impacting my data transformation scripts. I'm not expecting bbox information among my transcriptions. Also, it seems like the bboxes mostly happen when there's no actual transcription added, just empty text. But empty text is also important information in my use case.