knaw-huc / textannoviz

GNU General Public License v3.0
1 stars 1 forks source link

Error returning result 10000 or higher #123

Open brechtfm opened 2 weeks ago

brechtfm commented 2 weeks ago

Cannot view the 10001st result, the following error message is returned (e.g. at page 1001 if viewing 10 results per page):

Query not understood: IndexQuery(date=null, terms={}, text=jacht, range=null, aggregations=null)
svandaalen commented 2 weeks ago

This appears to be an ES 'issue'

broccoli_1  | WARN  [2024-06-14 16:04:14,197] nl.knaw.huc.broccoli.resources.projects.ProjectsResource: ElasticSearch
failed status="400" query="IndexQuery(date=null, terms={}, text=jacht, range=null, aggregations=null)" result="{"error":
{"root_cause":[{"type":"illegal_argument_exception","reason":"Result window is too large, from + size must be less than or 
equal to: [10000] but was [10010]. See the scroll api for a more efficient way to request large data sets. This limit can be set 
by changing the [index.max_result_window] index level 
setting."}],"type":"search_phase_execution_exception","reason":"all shards 
failed","phase":"query","grouped":true,"failed_shards":[{"shard":0,"index":"docs-2024-03-
18","node":"8OlLhOd6SFiZSVi0R7dKzA","reason":{"type":"illegal_argument_exception","reason":"Result window is too 
large, from + size must be less than or equal to: [10000] but was [10010]. See the scroll api for a more efficient way to 
request large data sets. This limit can be set by changing the [index.max_result_window] index level 
setting."}}],"caused_by":{"type":"illegal_argument_exception","reason":"Result window is too large, from + size must be 
less than or equal to: [10000] but was [10010]. See the scroll api for a more efficient way to request large data sets. This 
limit can be set by changing the [index.max_result_window] index level setting.","caused_by":
{"type":"illegal_argument_exception","reason":"Result window is too large, from + size must be less than or equal to: 
[10000] but was [10010]. See the scroll api for a more efficient way to request large data sets. This limit can be set by 
changing the [index.max_result_window] index level setting."}}},"status":400}"

@hayco and I will look at this next week. Both 'gloccoli' and 'broccoli' are affected.