I see the flow is detecting page layout using the rtdetr model, which have label classes from doclaynet along with some additional labels, then use tableformer for table structure, but what exactly you do for reading order? I see some clustering mentioned in the code for page assemble? Can you explain the code and add some example to explain it? Any test or sample code I can follow to understand this better?
Question
...
I see the flow is detecting page layout using the rtdetr model, which have label classes from doclaynet along with some additional labels, then use tableformer for table structure, but what exactly you do for reading order? I see some clustering mentioned in the code for page assemble? Can you explain the code and add some example to explain it? Any test or sample code I can follow to understand this better?