Closed NaumanHSA closed 1 year ago
1、suggest you read the original paper or some blogs 2、yes, if the effect of detection doesn't work well , you can train/finetune the model on your own dataset 3、you can refer to the num of xfund_zh dataset, and it's hmean:https://github.com/PaddlePaddle/PaddleOCR/blob/release%2F2.6/doc/doc_en/algorithm_kie_layoutxlm_en.md
Thank you @an1018. Yes, I'm doing some research on these models to understand them fully.
Hi,
I'm new to PaddleOCR and want to train RE model on my custom dataset. I've annotated around 50 images using Label Studio and parsed them according to the PaddleOCR documentation. I set the ML backend in Label Studio to PPOCR engine for text detection and recognition.
In my custom dataset, the question-answer pairs are very close to each other e.g.
Name: ABC
for which the PaddleOCR engine creates only one box. I had to adjust and create another box to make separate boxes for questions and answers. Also, some text wouldn't recognize correctly (mostly spaces wouldn't be detected). My questions are:Thank you all in advance.