opensearch-project / neural-search

Plugin that adds dense neural retrieval into the OpenSearch ecosytem
Apache License 2.0
57 stars 58 forks source link

[META] Improve Hybrid query latency #704

Open martin-gaievski opened 2 months ago

martin-gaievski commented 2 months ago

Hybrid query has high latency comparing to other compound queries like Boolean query. Based on results collected for 2.13 and depending on the dataset and exact query it may be up to 10 times slower than Bool. Another reason for this issue is degradation in performance of hybrid query comparing to initial release e.g. in OpenSearch 2.11.

Following are goals for this work:

There were some GH issues in the past that are related to the same problem, e.g. https://github.com/opensearch-project/neural-search/issues/281. In addition to that, based on analysis of the source code and some profiling I can think of following list of items:

Github issues for each child item: