huggingface / pixparse

Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data
11 stars 3 forks source link

Pablo/cruller docvqa #18

Closed molbap closed 1 year ago

molbap commented 1 year ago

Still WIP, figuring out why the ANLS performance is lower with shorter training. Might come from the DistributedSampler and local_rank in place of global_rank? else, adds