AtlasOfLivingAustralia / data-management

Data management issue tracking
7 stars 0 forks source link

WELT data issues #1072

Open cha801p opened 3 weeks ago

cha801p commented 3 weeks ago

The following issues were raised by the data provider: https://support.ehelp.edu.au/a/tickets/203321

cha801p commented 3 weeks ago

Ticket Update: June 5, 2024 (6:30 PM)

Issue: WELT data issues

Problems:

  1. Collector Names Missing: Certain collector names are being stripped out from the source data. For example, in the dataset entry SP119852, the collectors Carlos and Karin have been removed. However, in dataset entry SP110073, both Leon and Lara are correctly shown. This inconsistency needs to be investigated and resolved to ensure all collector names are properly retained.

  2. RecordedByID Field Issue: When multiple ORCIDs or Wikidata QIDs are supplied, an anomaly occurs in the recordedByID field. The second ID gets concatenated with itself and the other ID SP119852](https://biocache.ala.org.au/occurrences/56d2fc2c-2eeb-4511-bba1-d89a7eb2090f). For instance , instead of showing:

    https://orcid.org/0000-0002-2118-3534 | https://orcid.org/0000-0002-7136-0017

    it appears as:

    https://orcid.org/0000-0002-7136-0017|https://orcid.org/0000-0002-7136-0017|https://orcid.org/0000-0002-2118-3534

Possible Solution: The data has been thoroughly reviewed, found to be fine, and subsequently reingested.

Status:

cha801p commented 3 weeks ago

Issue forwarded to systems.

adam-collins commented 3 weeks ago

These are fixed in the pipelines version currently in the test environment. I think you can find existing github issues for both in biocache-service and/or biocache-hubs.