sumatrapdfreader / sumatrapdf

SumatraPDF reader
http://www.sumatrapdfreader.org
GNU General Public License v3.0
13.43k stars 1.71k forks source link

Text selection problem with some documents #2013

Open marcoscmonteiro opened 3 years ago

marcoscmonteiro commented 3 years ago

Text selection does not work well with some documents. I'm attaching one which we can observe malfunction. This problem occurs with 3.2 version and 3.3 pre release. I'm attaching an image and PDF document with problem. In other PDF viewers (e.g. Adobe Reader) this problem does not occurs.

image DOS FATOS E FUNDAMENTOS JURÍDICOS DOS PEDIDOS.pdf

GitHubRulesOK commented 3 years ago

Without opening it, Looks like it was run through an OCR app here it is in Edge image Ok it does not say OCR but it certainly looks corrupted by reprinting (that's a very old version of PDF-Creator) perhaps needs to be resaved using PDFCreator-4.3

@kjk Garbled in Tracker eXchange and Edge I would say that MuPDF would declare this an abomination not worth fixing?

marcoscmonteiro commented 3 years ago

In fact mupdf-gl.exe (1.18.0) with same problem:

image

GitHubRulesOK commented 1 year ago

@kjk My view on this one is the file is corrupt in all viewers and editors thus its unfixable by MuPDF
Although Acrobat has a reasonable alignment of characters they are garbled there too image