biothings / mygeneset.info

Apache License 2.0
5 stars 3 forks source link

Discrepancy between `/metadata` curated count and `/query` count #47

Closed vincerubinetti closed 1 year ago

vincerubinetti commented 2 years ago

Per https://mygeneset.info/v1/metadata, there are 195647 curated genesets and 195650 total genesets. Per https://mygeneset.info/v1/query?q=_exists_:source there are 146961 curated genesets. Per https://mygeneset.info/v1/query there are 146964 total genesets.

The differences of 3 make sense because there are only 3 user-created genesets at the moment, all created by ravila4.

Motivation for this question:

Here I'm using the /metadata info for numbers:

image

But here I'm using /query for numbers (note the very bottom under the table):

image

I'd like them to match.

ravila4 commented 2 years ago

Not sure, what is the cause of this, but I will check if there is still a discrepancy after the next data update.

ravila4 commented 1 year ago

This issue seems to be fixed in the latest data release. I think it was caused by some documents from the smpdb data source that were missing the 'is_public' field, so they didn't show up in queries.