Open lvyoudashuju opened 2 weeks ago
Re-rank model is very time consuming. I suggest not to use re-rank model since it's low in terms of ROI.
Reordering is required for accurate retrieval.
Re-rank model is very time consuming. I suggest not to use re-rank model since it's low in terms of ROI.
Even if I don't use the re-rank model, it's still slow. Are there any optimization methods? Is it related to the block size of the embedding or the embedding model?
Click on it and you can see the elapsed time of every step of retrieval. Paste that screen shot here if it's convenient for you.
Describe your problem
There is only one node deployed now. If multiple nodes are added, will the retrieval efficiency be improved? Or is there any way you can improve the efficiency of retrieval?