unidoc / unipdf

Golang PDF library for creating and processing PDF files (pure go)
https://unidoc.io
Other
2.48k stars 250 forks source link

[BUG] Unexpected output from extractor.ExtractText() in v3.17 #438

Closed compleatang closed 3 years ago

compleatang commented 3 years ago

Upgrading to 3.17 and seeing a regression in one of my tests.

expected: "[end]"
actual  : "\x1c\x04\x03\x12\x1d"

The tests reads the last page of a PDF, runs ExtractText() and matches the end of the extracted string. Test passes fine on 3.13, .14, .15, and .16 with no changes other than go.mod.

github-actions[bot] commented 3 years ago

Welcome! Thanks for posting your first issue. The way things work here is that while customer issues are prioritized, other issues go into our backlog where they are assessed and fitted into the roadmap when suitable. If you need to get this done, consider buying a license which also enables you to use it in your commercial products. More information can be found on https://unidoc.io/

gunnsth commented 3 years ago

@compleatang Please share your test case and files. Otherwise we cannot reproduce this.

compleatang commented 3 years ago

FWIW, 3.18 works fine, as does 3.16, 15, 14, 13, and 12. I'm not sure what happened with 3.17 and haven't had time to isolate out for your my test case from our test suite, but it's all fine with 3.18 so I'm going to close the issue.