Closed 1006076811 closed 1 day ago
Obviously, the model deployed by xinference can't be accessed.
Obviously, the model deployed by xinference can't be accessed.
The error message is to use the model id: BGE-M3-0, but I have not configured BGE-m3-0, I have only configured bge-m3, I suspect that there is an extra -0 which causes the access failure. Because the first few times of embeding can be carried out normally, I do not know which step will become bge-m3-0
I see. My Batch_size is set too large, which causes the video memory to exceed. Because xinference is deployed using docker, I did not see the error message
Is there an existing issue for the same bug?
RAGFlow workspace code commit ID
v0.14.1
RAGFlow image version
v0.14.1
Other environment information
No response
Actual behavior
Error occurred while parsing the file using bge-m3 and qwen-7.2b deployed with Infinity and Xinference.
Expected behavior
No response
Steps to reproduce
Additional information
No response