NielsRogge / Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.
MIT License
9.16k stars 1.42k forks source link

DONUT fine-tuning with missing value in key-value pairs information extraction #319

Open giulio-derasmo opened 1 year ago

giulio-derasmo commented 1 year ago

Hi all, I'm trying to fine-tuning DONUT model over my dataset for the task Key Information Extraction. My parsing dictionary is similar to SROIE but for all the sample there are at least 1 key with has a missing value. Is it better to train with a ground truth like: { 'key1': 'nan', 'key2': 'value2', 'key3': 'value3', 'key4': 'nan'} or just {'key2': 'value2', 'key3': 'value3'}

Thanks for the answers!