clovaai / donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
https://arxiv.org/abs/2111.15664
MIT License
5.53k stars 444 forks source link

Handling Variable Element Presence in Parsing Document Task #248

Open abdelaziz-jaddi opened 10 months ago

abdelaziz-jaddi commented 10 months ago

Hello,

Sometimes, when tackling a document comprehension problem, it can be observed that elements are not always consistently present in documents, as illustrated by this example:

Document 1 (element 1: present, element 2: present, element 3: absent)

Document 2 (element 1: absent, element 2: present, element 3: present)

To prepare the training data, it is possible to add the category "other" when the element is not present in the document ?