bitextor / pdf-extract

PDF parser and converter to HTML
GNU General Public License v3.0
83 stars 14 forks source link

Deadlock on stderr from pdftohtml #38

Closed kpu closed 4 years ago

kpu commented 4 years ago

@amirkamran was right, there is a lot of stuff in here that should have just been a shell script.

If pdftohtml produces more stderr than the pipe buffer can handle, the process will deadlock and then hopefully timeout.

https://github.com/bitextor/pdf-extract/blob/4ad28a23817851355ba65b6b4699a8f01b2cb760/src/pdfextract/PDFToHtml.java#L175