mozilla / pdf.js

PDF Reader in JavaScript
https://mozilla.github.io/pdf.js/
Apache License 2.0
48.32k stars 9.97k forks source link

Horizontal lines displayed on certain pages of a PDF #6851

Closed scottschluer closed 8 years ago

scottschluer commented 8 years ago

I can only reproduce this with a single PDF (which I can't attach due to HIPAA regulations). The first three or so pages of the PDF look like they were merged from an existing PDF document and render just fine, but the remaining pages were all scanned in. All of those pages display in the PDF.js viewer with horizontal lines across them (see screenshot).

If I view the source PDF in Adobe Acrobat, there are no horizontal lines. I'm using v1.2.107.

screenshot

timvandermeij commented 8 years ago

Does it still occur with the latest version of PDF.js. Try opening the PDF with https://mozilla.github.io/pdf.js/web/viewer.html (Open File button in the toolbar). Without an example PDF, there is unfortunately not much we can do to look into the issue...

scottschluer commented 8 years ago

Yes, it does. I downloaded the latest version during my troubleshooting a couple of days ago. I realize it's not very helpful without the PDF but I figured I'd log it anyways.

timvandermeij commented 8 years ago

All I can think of is that it might be a CCITTFaxStream issue, but there is really no way to tell unless there is an example PDF. Is it also not possible to create a reduced test case (for instance only one of the pages, with any sensitive information removed)?

scottschluer commented 8 years ago

I'll see what I can do. I'll grab a page but I need to get approval to upload it here so it may be a day or two. Stay tuned...

timvandermeij commented 8 years ago

I'm going to close this issue for now, but we will reopen it when a test file becomes available. Without it there appears to be nothing we can do about this.