unidoc / unipdf

Golang PDF library for creating and processing PDF files (pure go)
https://unidoc.io
Other
2.54k stars 250 forks source link

[BUG] Cannot extract text from created report with composite font #322

Closed gunnsth closed 3 years ago

gunnsth commented 4 years ago

Description

A problem was reported via email:

A report is created with a composite font (Japanese) with package creator. The text extractor is unable to correctly extract the text back.

Expected Behavior

Expected to be able to extract the text that was added to the report.

Actual Behavior

Steps to reproduce the behavior:

  1. Run japanese2.go which produces the PDF
  2. Run text extraction on the PDF
  3. See that the text is not extracted correctly

Attachments

Code for producing the report has been provided.