scambier / obsidian-omnisearch

A search engine that "just works" for Obsidian. Supports OCR and PDF indexing.
GNU General Public License v3.0
1.14k stars 57 forks source link

[BUG] Obsidian Crashing While Indexing PDF With Omnisearch #316

Open JamesIslan opened 9 months ago

JamesIslan commented 9 months ago

Hello! I'm trying to use Omnisearch to index some of my OCR PDF files inside Obsidian. However, i'm facing some problems with it, as the amount of files is big. I noticed that the obsidian.exe process reach 4GB on the Task Manager, and after that the program crashes and forces me to reload. When i do that, the plugin automatically disables the file cache indexing. I really need the cache to be enabled, as the indexing without it would take a really long time on every program opening. I would like to know if there's a way to limit the RAM usage of the Obsidian process to be always below 4GB, so that the crash stop occurring. I've already tried to index the PDF files one by one, but as some of the files have big size (~100MB), this workaround isn't enough, and after some files are indexed, the crash occur anyway. Any solution or hint would be really appreciating!

image

Things to try:

scambier commented 9 months ago

I think that unfortunately, you're hitting the limits of Omnisearch here. The cache is simply too large, and disables itself to avoid bootloop crashes. I can only recommend to not index your PDFs, or place the biggest ones in an excluded folder 🤷‍♂️

I'll leave this open for now and see if a fix is possible in Minisearch.

scambier commented 3 months ago

@JamesIslan I have updated the cache loading mechanism in the latest Omnisearch update (1.23). Could you please check and report if your issue still persists?