alibaba / AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
Apache License 2.0
1.98k stars 291 forks source link

Few shot learning for Document AI #68

Closed SimJeg closed 1 year ago

SimJeg commented 1 year ago

Hello,

I am working on a practical use-case of Document understanding and wondering if I could leverage models such as StructuralLM. The goal is to extract key informations from the document (in fields or tables). The trick is that I only have a few training samples (<50) and I don't think VQA would apply as these informations are very specific and not always associated with a clear question.

Here are the 2 options I have in mind :

Many thanks for your inputs, Simon

lcl6679292 commented 1 year ago

Thank you for your attention. I suggest that you first use the model of public data training, such as DocVQA, to fine-tune your few training samples.