SynBioDex / SBOLExplorer

MIT License
4 stars 2 forks source link

Problem due to elastic search being limited to 10000 results #93

Open cjmyers opened 3 years ago

cjmyers commented 3 years ago

Elastic search only returns the top 10000 scoring results. If when these are intersected with the Virtuoso criteria results, there are less than a sufficient number of responses (considering offset and limit of the search) and Virtuoso returns results that get filtered by elastic search results, then you may be missing some responses that have scores below the top 10000 hits. To address this, we need to either:

1) Get more elastic search results until you have sufficient number to return intersecting with the Virtuoso results.

OR

2) Need to return those results that Virtuoso returns that are not returned by elastic search (caveat these will not be sorted by page rank and may actually not match the search string).

OR

3) Resort to full Virtuoso search including the search string.

yu-eric commented 3 years ago

@cjmyers See #95.

cjmyers commented 3 years ago

0) Insufficient number of results to return (<limit considering the offset) 1) Elasticsearch returns 10000 hits 2) Virtuoso returns results that are not in the 10000 hits from elastic search