gbif / portal-feedback

User feedback for the GBIF API, website and published data. You can ask questions here. 🗨❓
30 stars 16 forks source link

wrong data mapping? #3162

Open gbif-portal opened 3 years ago

gbif-portal commented 3 years ago

wrong data mapping?

The ExtendedMeasurementOrFact seem to be wrongly mapped at the source? https://api.gbif.org/v1/occurrence/2982146336/fragment

There isn't really any information in it

{
"measurementType": null,
"measurementUnit": null,
"measurementValue": null,
"occurrenceID": "UCALDAS:MHN-UCa:Reptiles:558"
},

Github user: @MortenHofft User: See in registry System: Chrome 87.0.4280 / Mac OS X 10.15.7 Referer: https://www.gbif.org/occurrence/2982146336 Window size: width 1440 - height 766 API log&_a=(columns:!(_source),index:'prod-varnish-',interval:auto,query:(query_string:(analyze_wildcard:!t,query:'response:%3E499')),sort:!('@timestamp',desc))) Site log&_a=(columns:!(_source),index:'prod-portal-',interval:auto,query:(query_string:(analyze_wildcard:!t,query:'response:%3E499')),sort:!('@timestamp',desc))) System health at time of feedback: WARNING datasetKey: 243bb217-84a5-43d5-adca-af481ab6ab8f publishingOrgKey: f7f9717e-9e50-4a00-a30f-7b134390a566

ManonGros commented 3 years ago

The data is mapped correctly but many rows are empty with no measurement. I supposed this is an artefact of the way the original file was generated.

MortenHofft commented 3 years ago

But shouldn't it be removed since there is no information? Or perhaps we should remove it during interpretation since it is just noise? We can also just leave it, but it just feels nonsense data. Or have I missed how this brings value?

jhnwllr commented 2 years ago

This seems like a problem that should be handled by interpretation. I don't think that we can enforce that all publishers remove NULL measurements or facts.

camiplata commented 2 years ago

I suport the Idea of handling the problem during interpetration, most publishers find it dificult to manage diferent tables for occurrences and measurments and even more if this requieres having different number or rows.