khoj-ai / khoj

Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our cloud instance. Access from Obsidian, Emacs, Desktop app, Web or Whatsapp.
https://khoj.dev
GNU Affero General Public License v3.0
12.64k stars 640 forks source link

Include filename, page of PDF results in Emacs for better context #497

Open agzam opened 11 months ago

agzam commented 11 months ago

I primarily use Khoj from Emacs. It appears that when searching using "Content Type = all", the 'Khoj Search' buffer gets populated with excerpts from PDF sources. However, it does not specify the file, page, or location these excerpts are drawn from. When searching only in PDF files the search results do not indicate the page and location either, but at least the filename is present. When you search mixing the content type, the results are somewhat not very useful.

debanjum commented 10 months ago

Thanks for pointing that out. We haven't tested the PDF rendering in Emacs extensively but not being able to see the context of the PDF results (like filename, page number etc.) isn't great. Let us look into fixing that soon