[Bug]: Chat and query are too slow

medtrans commented 1 month ago

Is there an existing issue for the same bug?

[X] I have checked the existing issues.

Branch name

main

Commit ID

none

Other environment information

win11

Actual behavior

上传的文件多了之后，对话和查询巨慢，需要好几分钟，无法忍受和正常使用

Expected behavior

上传的文件多了之后，对话和查询巨慢，需要好几分钟，无法忍受和正常使用

Steps to reproduce

对话和查询非常慢，耗时好几分钟，无法忍受

Additional information

上传的文件多了之后，对话和查询巨慢，需要好几分钟，无法忍受和正常使用

medtrans commented 1 month ago

对话和查询的时候为什么只用CPU单线程，而不用gpu推理，这是严重问题，根本没法使用

KevinHuSh commented 1 month ago

If there're much documents, please expand ES nodes to make a cluster.

Liam-Ji commented 2 weeks ago

确实有这个问题，添加日志也能看到，ES检索慢的很再加上rank处理，要七八分钟才出结果知识库上传了大约100万条数据，这到底是哪里的问题呢

KevinHuSh commented 2 weeks ago

3546 result need to be re-rank, it would not be fast.

Liam-Ji commented 2 weeks ago

The time for ES retrieval itself is quite long. May I ask how to control the number of output results for ES retrieval? Attached is a sample of the parameters for a retrieval request { "query": { "bool": { "must": [ { "query_string": { "fields": [ "title_tks^10", "title_sm_tks^5", "important_kwd^30", "important_tks^20", "content_ltks^2", "content_sm_ltks" ], "type": "best_fields", "query": "((案情)^0.25144159157178586 (案例)^0.24685589599849458", "boost": 1, "minimum_should_match": "30%" } } ], "filter": [ { "terms": { "kb_id": [ "6b70bacc8a0311efb5c80242ac180006" ] } }, { "bool": { "must_not": [ { "range": { "available_int": { "lt": 1 } } } ] } } ], "boost": 0.05 } }, "from": 0, "size": 90, "knn": { "field": "q_768_vec", "k": 3, "similarity": 0.1, "num_candidates": 6, "query_vector": [ 0.028860345482826234, ... -0.01591445878148079 ], "filter": { "bool": { "must": [ { "query_string": { "fields": [ "title_tks^10", "title_sm_tks^5", "important_kwd^30", "important_tks^20", "content_ltks^2", "content_sm_ltks" ], "type": "best_fields", "query": "((案情)^0.25144159157178586 (案例)^0.24685589599849458", "boost": 1, "minimum_should_match": "30%" } } ], "filter": [ { "terms": { "kb_id": [ "6b70bacc8a0311efb5c80242ac180006" ] } }, { "bool": { "must_not": [ { "range": { "available_int": { "lt": 1 } } } ] } } ], "boost": 0.05 } } } }

KevinHuSh commented 2 weeks ago

Refer to this about pagination of ES. The speed of retrieval for ES depends on some fators like, number of docs indexed, RAM size, disk speed.

infiniflow / ragflow