gbif-norway / helpdesk

Please submit your helpdesk request here (or send an email to helpdesk@gbif.no). We will also use this repo for documentation of node helpdesk cases.
GNU General Public License v3.0
3 stars 0 forks source link

Three new datasets from Nils Valland #192

Open dagendresen opened 2 hours ago

dagendresen commented 2 hours ago

Three new data files for the dataset of Nils Valland

https://doi.org/10.15468/pzo4mb https://ipt.gbif.no/resource?r=nils

Sifnos-Folegandros_2024.xlsx 20241008_Tavira.xlsx 2023_Guardamar.xlsx

I have uploaded these three data files to the IPT.

Photo URLs

The links to the photos are links to the Google Photos page ABOUT the photo and not direct URLs for the photo files themselves. If the images should be resolved and displayed in the GBIF portal somebody needs to fish out the correct image URLs.

For example

occurrenceID b9472887-490c-4875-9c50-1b244e7b4fcd associatedMedia = https://photos.app.goo.gl/3ZZRFrcUKkFSLev27 --> https://lh3.googleusercontent.com/pw/AP1GczOvbL1Rt4DQLWNNg6gN-PUrTLjaV2fek6836f9YboKlOpPsGYfvGihE9yAH0BGyT3ea8zIH8DW-bsPIxA0k9MUpFDwJNu9nvET7I8lqhB8Rz_tsHTiVUEE0YT0jlgGG0ZyEJzo6mHg6RZDBH5kneo-pNQ=w2942-h1656-s-no-gm?authuser=0

I have suggested to Nils that he update the image URLs if he wants the photos to be visible on the GBIF portal.

Warning on special character ´

Note also the eventDate and decimalLongitude fields included the special character ´ in the original files from Nils to avoid Excel from transforming the data types. I have removed these ´ special characters in the files now on the IPT.

ORCID strings

I have fixed the incomplete ORCID identifiers (one zero missing at the start of the identifier string) 009-0003-8602-3413 --> 0009-0003-8602-3413 and to include the full ORCID, 009-0003-8602-3413 --> https://orcid.org/0009-0003-8602-3413

Suggested improvement

I would have added the prefix urn:uuid: to the UUID identifiers for occurrenceID (however, now published using the naked UUIDs as provided by Nils).

There might be more validation steps helpdesk@gbif.no might want to perform?

dagendresen commented 2 hours ago

Error message: Publishing version #1.34 of resource nils failed: Archive generation for resource nils failed: Can't validate DwC-A for resource nils. Each row in the occurrence file(s) must have a basisOfRecord, and each basisOfRecord must match the Darwin Core Type Vocabulary (please note comparisons are case insensitive)

Changed mappings to basisOfRecord to hard-coded HumanObservation (was reported as Human observation if the data files).