Retrying request to /embeddings

win4r / GraphRAG4OpenWebUI

GraphRAG4OpenWebUI integrates Microsoft's GraphRAG technology into Open WebUI, providing a versatile information retrieval API. It combines local, global, and web searches for advanced Q&A systems and search engines. This tool simplifies graph-based retrieval integration in open web environments.

https://www.youtube.com/@AIsuperdomain

Apache License 2.0

293 stars 68 forks source link

Retrying request to /embeddings #12

Open rickywu opened 1 month ago

rickywu commented 1 month ago

I didn't config TAVILY_API_KEY

export TAVILY_API_KEY=""

export INPUT_DIR="xxx/input/artifacts"

export GRAPHRAG_API_KEY=""

export GRAPHRAG_API_KEY_EMBEDDING=""

export GRAPHRAG_LLM_MODEL="Qwen1.5-14B-Chat-GPTQ-Int4"

export API_BASE="http://172.17.22.174:9997/v1"

export API_BASE_EMBEDDING="http://172.17.22.174:9997/v1"

export GRAPHRAG_EMBEDDING_MODEL="m3e-base"

LLM and embedding served by xinference, but always get logs like this:

2024-08-02 15:45:51,474 - openai._base_client - INFO - Retrying request to /embeddings in 0.971267 seconds

xldistance commented 1 month ago

Your xinference embedded model doesn't start. You need to install xinference and download the startup embedded model m3e-base, I prefer bge-m3.

rickywu commented 1 month ago

@xldistance I'm sure it's started, I start embeding model first then LLM

xldistance commented 1 month ago

@xldistance I'm sure it's started, I start embeding model first then LLM You can try export API_BASE_EMBEDDING="http://127.0.0.1:9997/v1"