Open shubhdotai opened 7 months ago
You need to use layout recognizer. Please look into code in rag/app. May this help.
Thanks for following
Layout recogniser only returns the layout (bounding box and corresponding label). However it doesn't return the text data in that box. Any direct function or code for that?
This function is for this purpose.
Describe your problem
If there is a pdf with 2 columns with headings and tables. I want to extract the text/OCR result separately for individual layout segments. How can I do it directly just by using deepdoc?