holtwick / pdfify

Issue tracking for PDFify. To upvote features give a 👍
https://pdfify.app/future?ref=github&kw=start
12 stars 0 forks source link

incorrect text-extraction #66

Closed karstenBriksoft closed 2 years ago

karstenBriksoft commented 3 years ago

Please describe how the error can be reproduced: I tried to convert the text from https://www.instagram.com/p/CAU_UGQiSZU/ (first text-picture from the 1080w source). When copying the text to clipboard, the umlauts are missing and one line reads PfannevordemGebrauchbeiStufe7erhitzen.DasBrot (no spaces)

App Version Info: de.holtwick.mac.PDFify@3.3.2+130 84f018b5947244feaf21fe750b52647e

holtwick commented 3 years ago

Oh, this looks so tasty! :)

It works for me with Tesseract and language set to German. What are your settings?

karstenBriksoft commented 3 years ago

My settings were English, duh. After downloading German (loaded the precise-version) and activating it, the umlauts are correct, but there's still this one long word.

holtwick commented 2 years ago

Indeed, I'll fix it with the next release:

20220308-172516-capture-holtwick@2x

holtwick commented 2 years ago

Is fixed in the next release. You may want to test it in the latest beta pdfify.app/help#beta