Closed Robsteranium closed 3 years ago
This appears to be caused by the select queries used for paging. The results for successive pages (with increasing limit/offset) aren't contiguous (even without any writes in the meantime). Adding an e.g. ORDER BY ?uri
clause seems to resolve this.
The same problem will plague the other pipelines. This solution doesn't work for the observation pager as the order by clause causes it to time out (at least on idp-beta with 28m observations).
For some reason this field has missing values. It ought to be either
"true"
or"false"
.If I run the following query:
Then across the buckets I see a total of 49,441 values despite having 50,337 codes.
Indeed the following query matching the remaining 896 docs where this field is missing.
It's not clear if this is to do with select-pagination or upserts.