Layout-Parser / layout-model-training

The scripts for training Detectron2-based Layout Models on popular layout analysis datasets
198 stars 54 forks source link

Need suggestion while label mapping #6

Open SAIVENKATARAJU opened 2 years ago

SAIVENKATARAJU commented 2 years ago

Hi, I am currently fine tuning layout parser on my custom dataset. I am using pubLayNet/faster_rcnn_R_50_FPN_3x as my base model but according to this model output label set is something like this. {0: "Text", 1: "Title", 2: "List", 3: "Table", 4: "Figure"}. but in my original PDF I just want to use "Title", "Section", "Paragraph", "ListItem", PageNumber""Table". my question is : what should be the order of the label mapping. Also, with use of pre-trained model its pretty much detecting tables in custom data and I Just don't want to ruin it. can you please suggest me how should I Move along?.

kyle-bong commented 2 years ago

Hello. Did you find an solution for this problem? I want to fine tuning layout parsor my custom dataset like you, and the category of label and the number of label of my dataset would be slightly different from Layout-Parser. If you found the solution, can you tell me how? Thank you!

NixBiks commented 1 year ago

Any updates from any of you here?