RyotaUshio / obsidian-pdf-plus

The most Obsidian-native PDF annotation, viewing & editing tool ever. Comes with optional Vim keybindings.
https://ryotaushio.github.io/obsidian-pdf-plus/
MIT License
471 stars 11 forks source link

[FR] OCR #126

Open iris2jane opened 3 months ago

iris2jane commented 3 months ago

Describe your request

For many scanned PDF documents, batch OCR may not provide accurate recognition for document layout. As a result, I have to use rectangular selection combined with OCR.

Could you please add a command line option to export the selected links as images to a specified path, making it convenient to use the OCR functionality of other software?

If the plugin could directly support OCR integration, it would be even better.

Thank you for your hard work on this project!

RyotaUshio commented 3 months ago

Could you please add a command line option to export the selected links as images to a specified path, making it convenient to use the OCR functionality of other software?

I guess it's the same request as #65.

You can already specify the path from the Obsidian core's "default location for new attachments" setting, but I agree it will be nice to have more options than that.

If the plugin could directly support OCR integration, it would be even better.

I started playing around with some OCR libraries. One day it may come true.

RyotaUshio commented 3 months ago

@iris2jane By the way, what OCR tool do you use? I've just tried OCRmyPDF and it's impressive!

RyotaUshio commented 3 months ago

I wrote a simple plugin for calling OCRmyPDF from within Obsidian: https://github.com/RyotaUshio/obsidian-pdf-ocr

iris2jane commented 3 months ago

@iris2jane By the way, what OCR tool do you use? I've just tried OCRmyPDF and it's impressive!

I use Umi-OCR, which is an open-source software with active updates.

I wrote a simple plugin for calling OCRmyPDF from within Obsidian: https://github.com/RyotaUshio/obsidian-pdf-ocr

Thanks for your prompt and helpful responses!

EcceMaterTua commented 1 month ago

Not os, but PDF24 has been working great for me. It's free.