Hi Phil,
I'm trying to use DiT for Document Parsing(Key Information Extraction, KIE, from ID Cards ) since it seems to be a much lighter alternative to DONUT and LayoutLM. Is my premise correct? Also couldn't find a fine-tuned checkpoint of DiT for KIE. Did you work on this? Can you direct me to resources since there seems to be not much work on DiT for this task.
Hi Phil, I'm trying to use DiT for Document Parsing(Key Information Extraction, KIE, from ID Cards ) since it seems to be a much lighter alternative to DONUT and LayoutLM. Is my premise correct? Also couldn't find a fine-tuned checkpoint of DiT for KIE. Did you work on this? Can you direct me to resources since there seems to be not much work on DiT for this task.