microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
https://aka.ms/GeneralAI
MIT License
19.09k stars 2.44k forks source link

How to perform inference on a single image using fine-tuned LayoutLMv3 model? #1526

Open laminggg opened 2 months ago

laminggg commented 2 months ago

I have fine-tuned a LayoutLMv3 model and now I want to utilize it for layout analysis and information extraction on a single image. I have successfully trained this model, but I'm facing some difficulties during the inference phase.