huggingface / pixparse

Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data
11 stars 3 forks source link

Pablo/fix tokens docvqa #23

Open molbap opened 12 months ago

molbap commented 12 months ago

WIP, adding transforms in config as option + reworking some eval tasks + adding mid-resolution architecture.