microsoft / table-transformer

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
MIT License
2.22k stars 247 forks source link

Inference #93

Closed bsmock closed 1 year ago

bsmock commented 1 year ago

Adding inference pipeline to detect, recognize, and extract tables from images. (Note that for text content to be included in the extraction, text words need to be provided as additional input, along with the image, to the pipeline.)