michaelthwan / searchGPT

Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
MIT License
640 stars 65 forks source link

PDF extractor #45

Open michaelthwan opened 1 year ago

eren23 commented 1 year ago

Server side or client side with something like tesseractjs or pdf-extractor npm package?

michaelthwan commented 1 year ago

@eren23 Thanks for your suggestion 😊I am planning to use pypdf