philschmid / document-ai-transformers

MIT License
317 stars 47 forks source link

DiT for Document Parsing( KIE ) #16

Open rm-asif-amin opened 7 months ago

rm-asif-amin commented 7 months ago

Hi Phil, I'm trying to use DiT for Document Parsing(Key Information Extraction, KIE, from ID Cards ) since it seems to be a much lighter alternative to DONUT and LayoutLM. Is my premise correct? Also couldn't find a fine-tuned checkpoint of DiT for KIE. Did you work on this? Can you direct me to resources since there seems to be not much work on DiT for this task.