jrmuizel / pdf-extract

A rust library for extracting content from pdfs
396 stars 78 forks source link

Handle characters without unicode information #9

Open jrmuizel opened 6 years ago

jrmuizel commented 6 years ago

Sometimes we don't have the information needed to turn glyphs into unicode. We can probably improve the situation by storing a mapping of a hash of the glyph data to the unicode value.