Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
MIT License
2.31k
stars
255
forks
source link
Can table tranformer be used to detect multiple tables in an image? #157
For this image, when i try the table transformer, i get only 1 prediction. Like if i crop the image and then run on individual crops, the results are as expected. But is it possible to pass the whole image and get multiple predictions?
For this image, when i try the table transformer, i get only 1 prediction. Like if i crop the image and then run on individual crops, the results are as expected. But is it possible to pass the whole image and get multiple predictions?
Code used:
Also the results when uploading image on : https://huggingface.co/microsoft/table-transformer-detection, and results when running on local seem to be different. Does anyone know why?