Closed eugeniashurko closed 1 year ago
Hi @eugeniashurko ,
There is a (configured) 10 000 hit limit when querying ES. It make it difficult to get all docs.
If this is to be addressed, that has to be at index settings level.
I see, it's a good point! But I think from the forge side we should not add this limit 100
for no reason (when limit is specifically None at least), we can just make 'unlimited' queries and then depending on the index, not more than 10k resources will be returned. This behaviour was very unexpected for me and introduced some weird hard-to-debug side-effect.
Currently, when querying resources through
forge.elastic
, there is no way to get all the documents.For example, when running:
Only 100 resources are returned. Here is the output with
debug=True
To be able to retrieve more than 100 resources, some hacks need to be implemented. For example, fixing an unrealistically large limit:
Would be great to have a way to get all the documents by specifying
limit=None
, as withforge.sparql
.