microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
https://aka.ms/GeneralAI
MIT License
20.2k stars 2.55k forks source link

What tool to use to annotate the image data for training LayoutLMv2 model? #664

Open karndeepsingh opened 2 years ago

karndeepsingh commented 2 years ago

Hi, I have financial image documents and want to extract few entities from the image documents like Name, property address, page number etc. And want to use LayoutLMv2 model. For this reason, I need to prepare dataset for training the model hence I am unable to find a proper tool to annotate the dataset and prepare according to LayoutLMv2. Please help me to choose the right tool to annotate and prepare the data for fine tuning.

Thanks

wolfshow commented 2 years ago

This might help. https://github.com/microsoft/OCR-Form-Tools

karndeepsingh commented 2 years ago

This might help. https://github.com/microsoft/OCR-Form-Tools

@wolfshow Thanks for sharing. Just one more questions, Does it saves annotation in LayoutLM required format? Or need to convert those annotation into the desired format? If so then is their any script available to convert these annotation?

Thanks

seanbenhur commented 2 years ago

Did you try out the above tool? @karndeepsingh

Rajeshwar21 commented 2 years ago

Did you try out the above tool? @karndeepsingh

Have u found an open source tool for this task ? labeling layoutlm

karndeepsingh commented 2 years ago

Did you try out the above tool? @karndeepsingh

Have u found an open source tool for this task ? labeling layoutlm

I used paid tool called UBIAI to annotate for LayoutLm

vcjayan commented 2 years ago

Did you try out the above tool? @karndeepsingh

Have u found an open source tool for this task ? labeling layoutlm

I used paid tool called UBIAI to annotate for LayoutLm

Hi @karndeepsingh how much it costs for paid version? I have around 1K invoices to be trained Is the annotation done online or can the tool be installed in local machine like LabelStudio ? Thanks

karndeepsingh commented 2 years ago

Did you try out the above tool? @karndeepsingh

Have u found an open source tool for this task ? labeling layoutlm

I used paid tool called UBIAI to annotate for LayoutLm

Hi @karndeepsingh how much it costs for paid version? I have around 1K invoices to be trained Is the annotation done online or can the tool be installed in local machine like LabelStudio ? Thanks

Hi @vcjayan, You can mail your queries in the following email id : admin@ubiai.tools

Hope it helps.

Thanks, Karndeep Singh