Open martinratinaud opened 2 years ago
This is because the ti
is presented as a ligature in this document. However, this ligature is not part of Unicode’s Alphabetic Presentation Forms block (range U+FB0x
for latin scripts), and had been encoded in the document as a (
character: I can reproduce markus
' behaviour by copy-pasting.
The publishing software of that document is Apple Pages.
Is there any way for markus
to recover such an encoding?
Bug Report 🐛
Transform of this specific pdf file https://assets.website-files.com/615dba2b324d4ea51a398f26/622a2175014d39da4f4bf688_2022%2003%2014%20CGU%20Heetch%20France%20CLEAN.pdf leads to weird text transformation.
Steps to Reproduce
Launch
Current Behavior
It seems
ti
is replaced by(
Expected behaviour
Get as close to possible to
Context (Environment)
Desktop