Open minixxie opened 7 months ago
Seems no one came across this issue? or I'm the only person running it in 2 pods? Today I've tried again and successfully see data saved in the qdrant database, but when I check the list of docs saved, it's sometimes returnning empty (from the 2nd pod):
curl -X GET --url "http://private-gpt.local/v1/ingest/list" --header "Accept: application/json"
{"object":"list","model":"private-gpt","data":[{"object":"ingest.document","doc_id":"227ea4a8-863f-47d3-9cbf-75aa2bebc447","doc_metadata":{"file_name":"b"}},{"object":"ingest.document","doc_id":"6392894d-1da3-4d77-abd1-7c65e5a33535","doc_metadata":{"file_name":"a"}}]}
curl -X GET --url "http://private-gpt.local/v1/ingest/list" --header "Accept: application/json"
{"object":"list","model":"private-gpt","data":[]}
After some investigation, I found that there is a data file saving the doc on harddrive:
# in POD 1
worker@private-gpt-55cb54b557-2rp2g:/home/worker/app/local_data/private_gpt$ grep -l 6392894d-1da3-4d77-abd1-7c65e5a33535 *
docstore.json
# in POD 2
worker@private-gpt-55cb54b557-vfhdb:/home/worker/app/local_data/private_gpt$ grep -l 6392894d-1da3-4d77-abd1-7c65e5a33535 *
## <empty result>
Anyone knows how to avoid this so it can share the states across 2 pods? Thanks very much.
I'm able to make it run in 2 PODs by changing the document store and index store to use MongoDB: https://github.com/imartinez/privateGPT/compare/main...minixxie:privateGPT:dev
Not sure if this is the correct way of making it stateless.
@minixxie hey i recently added the support to use the pgvector as the vector store https://github.com/imartinez/privateGPT/pull/1624
which might be shared, moreover, have you used a shared PV/PVC or other storage for doc store or index store?
Discussed in https://github.com/imartinez/privateGPT/discussions/1558