microsoft / table-transformer

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
MIT License
2.01k stars 231 forks source link

Mapping of cells output with actual inference image #163

Open akshay-bothra opened 6 months ago

akshay-bothra commented 6 months ago

Hi, I am using inference for extracting cells objects. The output of detect is cropped and passed as an input to recognize. Now, recognize produces the cell objects for cropped table image. I want to map these cell bboxes on the actual image that was fed into the detection model. If you could provide some guidance on How should I map these to actual image input?