Closed medtrans closed 2 hours ago
对话和查询的时候为什么只用CPU单线程,而不用gpu推理,这是严重问题,根本没法使用
If there're much documents, please expand ES nodes to make a cluster.
确实有这个问题,添加日志也能看到,ES检索慢的很 再加上rank处理,要七八分钟才出结果 知识库上传了大约100万条数据,这到底是哪里的问题呢
3546 result need to be re-rank, it would not be fast.
The time for ES retrieval itself is quite long. May I ask how to control the number of output results for ES retrieval?
Attached is a sample of the parameters for a retrieval request
{ "query": { "bool": { "must": [ { "query_string": { "fields": [ "title_tks^10", "title_sm_tks^5", "important_kwd^30", "important_tks^20", "content_ltks^2", "content_sm_ltks" ], "type": "best_fields", "query": "((案情)^0.25144159157178586 (案例)^0.24685589599849458", "boost": 1, "minimum_should_match": "30%" } } ], "filter": [ { "terms": { "kb_id": [ "6b70bacc8a0311efb5c80242ac180006" ] } }, { "bool": { "must_not": [ { "range": { "available_int": { "lt": 1 } } } ] } } ], "boost": 0.05 } }, "from": 0, "size": 90, "knn": { "field": "q_768_vec", "k": 3, "similarity": 0.1, "num_candidates": 6, "query_vector": [ 0.028860345482826234, ... -0.01591445878148079 ], "filter": { "bool": { "must": [ { "query_string": { "fields": [ "title_tks^10", "title_sm_tks^5", "important_kwd^30", "important_tks^20", "content_ltks^2", "content_sm_ltks" ], "type": "best_fields", "query": "((案情)^0.25144159157178586 (案例)^0.24685589599849458", "boost": 1, "minimum_should_match": "30%" } } ], "filter": [ { "terms": { "kb_id": [ "6b70bacc8a0311efb5c80242ac180006" ] } }, { "bool": { "must_not": [ { "range": { "available_int": { "lt": 1 } } } ] } } ], "boost": 0.05 } } } }
Is there an existing issue for the same bug?
Branch name
main
Commit ID
none
Other environment information
Actual behavior
上传的文件多了之后,对话和查询巨慢,需要好几分钟,无法忍受和正常使用
Expected behavior
上传的文件多了之后,对话和查询巨慢,需要好几分钟,无法忍受和正常使用
Steps to reproduce
Additional information
上传的文件多了之后,对话和查询巨慢,需要好几分钟,无法忍受和正常使用