Pablo/cruller docvqa - Githubissues

huggingface / pixparse

Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data

11 stars 3 forks source link

Pablo/cruller docvqa #18

Closed molbap closed 1 year ago

molbap commented 1 year ago

Still WIP, figuring out why the ANLS performance is lower with shorter training. Might come from the DistributedSampler and local_rank in place of global_rank? else, adds

docVQA finetuning task, from a checkpoint pretrained on IDL or not
docVQA eval task on the val set (the test set is only through their website)