atlanhq / camelot

Camelot: PDF Table Extraction for Humans
https://camelot-py.readthedocs.io
Other
3.61k stars 349 forks source link

Same data fetched multiple times of a single table (Data Duplication) #364

Closed bansal95 closed 4 years ago

bansal95 commented 4 years ago

I have created a generic bank_statement _analyzer using camelot. During this I got an issue as my bank statement fetched 2 time. URL : https://firebasestorage.googleapis.com/v0/b/production-payme-india/o/other_document%2Ff60798eb7132f6e51ee68746c5bdf6a16838ec05f8ca2aff1f766e06602e99f1?alt=media

camelot.read_pdf(filename, pages='all', flavor="stream")

Result-URL : https://firebasestorage.googleapis.com/v0/b/production-payme-india/o/other_document%2F5c4350bc084c6bf7da2f4b69bcbabe9edb57ce6796e6c53867cee37e3db95d3a?alt=media

vinayak-mehta commented 4 years ago

Already opened here https://github.com/camelot-dev/camelot/issues/15