Lv1 has mixed training templates

google-research-datasets / vrdu

We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datasets that represent several challenges: rich schema including diverse data types, complex templates, and diversity of layouts within a single document type.

74 stars 5 forks source link

Hi @amitbcp,

Thanks for your interests in our work!

By templates, we mean that documents in the same or similar layout structures. The relative spatial relation on the page should be similar. Figure 4.b is a good example.

As you can tell from the document names, we group the documents according to the form types, e.g. amendment, dissemination report, and short form. Since documents in the same group are the same form, we believe they contain the same contents in a similar structure. We can also see a few variants in the same group (as you pointed out), but we believe such minor difference will not influence the final results greatly.

Thanks!

google-research-datasets / vrdu

Lv1 has mixed training templates #1