h2oai / h2ogpt

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
http://h2o.ai
Apache License 2.0
11.41k stars 1.25k forks source link

upload docs #1796

Open InesBenAmor99 opened 3 months ago

InesBenAmor99 commented 3 months ago

What could be the problem if, when uploading a document (simple unscanned PDF), the process gets stuck at the processing stage, without displaying any errors, and without the document being uploaded? I've also tested uploading to the online version, but that doesn't work either.

pseudotensor commented 2 months ago

Are you able to provide an example document that has issues? I expect it's getting stuck in using unstructured OCR or something that is very slow.

llmwesee commented 2 months ago

I'm also facing the same issue. I already uploaded around 720 documents in the UserData. But now whenever I trying to upload documents in collections like 'UserData' or 'MyData' the process gets stuck at the processing stage, without displaying any errors, and without the document being uploaded?

pseudotensor commented 2 months ago

Can you see what the processes are by looking at ps -auxwf and top (e.g. hit c and show in wide view)? Any processes still going that seem stuck?

I have over 3000 tests that run over a 24 hour period, and not noticing anything wrong.