Is LayoutLMV3 model is capable enough to predict nested bbox?

Hey there, I am using the layoutLMV3 model for my project and I created the dataset with nested bounding boxes in approx all images I've tried to fine-tune the LayoutLMV3 model with my dataset as coco format (which has 17 classes, more than 10L+, and 60k+ Images) but I'm not getting good AP. So my question is, is the LayoutLMV3 model powerful enough to predict nested bounding boxes if yes, can anyone please explain how or can anyone give some example link? (with nested bounding boxes prediction) @wolfshow Thank you

microsoft / unilm

Is LayoutLMV3 model is capable enough to predict nested bbox? #945