Use of detected cell bounding box for cell text extraction

DevashishPrasad / CascadeTabNet

This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"

MIT License

1.46k stars 422 forks source link

Use of detected cell bounding box for cell text extraction #128

Closed Atul997 closed 3 years ago

Atul997 commented 3 years ago

In processing of borderless tables detected cells are used in calculating rows and columns but not in the extraction of text in cells. Can you explain the reason ?

Vibha111094 commented 3 years ago

I also had a similar question. Are we supposed to further perform ocr on the cell? Any advice on this will be very helpful

AyanGadpal commented 3 years ago

Hi @Atul997 and @Vibha111094, Yes, We are providing you with the Text Coordinates and their position in Table. If you want the text, you will need to use OCR on the cells.