emesterhazy / glossika-to-anki

Convert Glossika PDFs and audio files into Anki decks
MIT License
32 stars 8 forks source link

Questions about extracting from the non-text pdfs #9

Closed exrulez closed 4 years ago

exrulez commented 4 years ago

Hello emesterhazy!

I used your program to convert my cantonese pdfs into anki, and I was blown away by the efficiency your script did this. I was doing it manually until I had the thought of searching on google, and you were the first solution I found. Such finesse and precision. It was amazing! I was trying to convert my zh pdfs as well but it seems like they are not text. I looked all over, but I can't seem to find find the text versions of these that are compatible with your software, would you happen to have them?

I'm trying to create a hybrid deck with zh-canto-eng, so i can do 2 langs at once, and your script would really help with this, but I can't seem to find the zh text pdfs anywhere.

Thank you so much for your hard work!

emesterhazy commented 4 years ago

Do you know which version of the PDFs you have? It's possible that there's copy protection enabled on your PDF which is preventing the script from reading the text. If you're able to highlight text in the PDF this is probably the issue. If you think this is the problem the good news is that there are lots of programs floating around that can fix this issue for your PDF. Let me know if that works.

emesterhazy commented 4 years ago

Closing this since there's been no activity for a month. Feel free to re-open if you're still having issues.