WorldBrain / Memex

Browser extension to curate, annotate, and discuss the most valuable content and ideas on the web. As individuals, teams and communities.
https://worldbrain.io
4.41k stars 336 forks source link

PDF text is copied without spaces #1222

Open mhrahmani opened 2 years ago

mhrahmani commented 2 years ago

When I open this in the PDF reader to annotate, the text annotated has no spaces.

http://www.imagomundi.com.br/espiritualidade/corbin_inside.pdf

This happens when I copy the text if it's displayed in the Memex PDF reader. The text layer is not "rendered" properly.

At the moment my solution is to copy the text from another reader and paste it as a comment under my annotation so it's indexed properly.

This is more just an FYI, I'm sure there'll be more PDF issues in the future since there's always PDF reading issues; It's like the office printer of file formats.

johnfuller commented 2 years ago

This is likely an issue with pdf.js, which the reader uses. That project is supported by Mozilla. For these sorts of issues, we'll just have to wait for pdf.js developers to fix these problems.

Here is an example of other people having similar issues.

https://github.com/mozilla/pdf.js/issues/10640