google-research-datasets / hiertext

The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and paragraph level annotations.
Creative Commons Attribution Share Alike 4.0 International
261 stars 23 forks source link

"vertices" fields in the paragraphs and lines level #9

Closed Asafgendler closed 1 year ago

Asafgendler commented 1 year ago

Hello and thanks for the great dataset again.

I wanted to ask when do you use the "vertices" fields in the paragraphs and lines level.

In your paper and repo you describe the line and paragraph level bounding polygon as the union of their contained words polygon, so it made me wonder if their "vertices" field is ever used.

Jyouhou commented 1 year ago

It's not used in the paper. The unified detector is a pure segmentation model.

Asafgendler commented 1 year ago

Thank you very much