scambier / obsidian-text-extractor

A (companion) plugin to facilitate the extraction of text from images (OCR) and PDFs.
GNU General Public License v3.0
349 stars 19 forks source link

Text Extraction Dependency #36

Closed khesed closed 1 year ago

khesed commented 1 year ago

I'm unclear on how the Obsidian plugin system works with regard to dependencies, but I'm curious what your thoughts are on using a general library like textract for text extraction?

scambier commented 1 year ago

There are different approaches regarding dependencies, but basically anything goes.

The philosophy I have with Text Extractor is to keep it self-contained, so dependencies must run in JS (either pure JS or wasm). Some plugins like Obsidian OCR need external dependencies to be installed on the host computer. That's also how the textract library works.