gbif / portal-feedback

User feedback for the GBIF API, website and published data. You can ask questions here. 🗨❓
30 stars 16 forks source link

Occurrence count incorrect in dataset search TSV export #4468

Open gbif-portal opened 1 year ago

gbif-portal commented 1 year ago

Occurrence count incorrect in dataset search TSV export

This is an issue originally reported on helplesk. It looks like the CSV file downloadable by clicking on "DOWNLOAD as TSV" has incorrect occurrence counts. The number of occurrences indicated in the CSV is twice the amount of occurrences shown on the website.

One of our users

found 5237 occurrences for the "Type Collection of the National Herbarium of Uzbekistan (TASH)" dataset in the TSV file. But according to the web-page this dataset contains only 3955 occurrences. There are same errors: "Phenology of Iridaceae" - 1771 occurences in the TSV file, and 1061 in the dataset web-page, also 'Phenology of Crocus' - 888 and 444 respectively, 'Phenology of Liliace' - 514 and 297, 'Chronicle of Nature - Phenology of Mammals of Surhanskiy State Nature Reserve' - 216 and 108.


Github user: @ManonGros User: See in registry - Send email System: Safari 16.1.0 / Mac OS X 10.15.7 Referer: https://www.gbif.org/dataset/search?publishing_country=UZ Window size: width 1349 - height 878 API log&_a=(columns:!(_source),filters:!(),index:'3390a910-fcda-11ea-a9ab-4375f2a9d11c',interval:auto,query:(language:kuery,query:''),sort:!())) Site log&_a=(columns:!(_source),filters:!(),index:'5c73f360-fce3-11ea-a9ab-4375f2a9d11c',interval:auto,query:(language:kuery,query:''),sort:!())) System health at time of feedback: OPERATIONAL

ManonGros commented 1 year ago

@fmendezh

MortenHofft commented 1 year ago

The API recordCount says the same. https://api.gbif.org/v1/dataset/search?publishing_country=UZ if that is of help @fmendezh

CecSve commented 1 year ago

Not sure this is relevant, but it might be so putting it here https://github.com/gbif/portal-feedback/issues/3187