Open RicoJYang opened 3 weeks ago
Considering the difference between the OCR results you obtained and the ones we use, this performance drop is relatively reasonable. We encourage you to experiment on the DoclayNet dataset to avoid inconsistencies in OCR acquisition.
In doc_multi_modal.py file:
During training, the print statement prints the corresponding text. Does it mean that the training is progressing normally? I performed OCR operation on the m6doc dataset using paddleocr and converted it using ocr_anno_convert.py. Why did I only get 68.1 mAP when using dino-4scale_w_m2doc_r101_m6doc_36epoch.py training?