plazi / GoldenGATE-Imagine

A GUI Tool For Freeing Text and Data from PDF Documents
Other
5 stars 0 forks source link

chinese font decoding issue FFC0FFFBFFFDAF75905FFFC6F917FFC6 #23

Open myrmoteras opened 2 years ago

myrmoteras commented 2 years ago

https://treatment.plazi.org/id/03F98783FFF9AF7090D7F8F2F824F84E in this treatment there are chinese symbols that do not decode well:

image image

gsautter commented 2 years ago

This is a bit of a question of what we want ... the fonts used in our decoder (Liberation Fonts, which are free) cannot render Chinese symbols, nor can either one of us read them (Jeremy might) ... in a sense, if something does decode to a Unicode symbol in the Chinese range, it is more often an obfuscated font or decoder error than an actual Chinese symbol ...

You can still enter these symbols in the font editor, though, but they are highly likely to render as placeholder question marks, indicating the font used for rendering doesn't have a glyph.