BobLd / tabula-sharp

Extract tables from PDF files (port of tabula-java)
MIT License
159 stars 26 forks source link

Investigating [BUG] - Stream: Area detection hangs on PDF page #30 #33

Open mikelor opened 3 months ago

mikelor commented 3 months ago

Investigating hangs on detecting regions in the SimpleNurminenAlgorithm.

See sample project mikelor/tabulate for sample PDFs and code.

Issue seems to be an infinite loop in the Detect Method here

        // removed following line. It's unclear how this code exit's the loop. When a table is found,
        // there is nothing to advance to the next set of criteria, so a table will always be found.
        // } while (foundTable);