User reported garbled text quoted when selecting content from a specific to annotate.
Confirmed that using PDF.js to select text in document -- both with Hypothesis client and natively in Firefox -- results in garbled output.
Using other PDF viewers (MacOS Preview, Adobe Acrobat, Chrome browser) to select text copies clean and ungarbled.
Running PDF through OCR again fixes issue.
This may be a fluke or extremely rare edge case, but it may be worth investigating why PDF.js behaves differently than other PDF viewers in rendering the text layer.
From user ticket: https://app.hubspot.com/contacts/6291320/ticket/573368454/
User reported garbled text quoted when selecting content from a specific to annotate.
Confirmed that using PDF.js to select text in document -- both with Hypothesis client and natively in Firefox -- results in garbled output.
Using other PDF viewers (MacOS Preview, Adobe Acrobat, Chrome browser) to select text copies clean and ungarbled.
Running PDF through OCR again fixes issue.
This may be a fluke or extremely rare edge case, but it may be worth investigating why PDF.js behaves differently than other PDF viewers in rendering the text layer.
Example PDF (Hypothesis staff only): https://drive.google.com/file/d/1qG9Ea5D3lVGNUrhsLCn_fB9BpX8MFnoN/view?usp=sharing