gbif / portal-feedback

User feedback for the GBIF API, website and published data. You can ask questions here. 🗨❓
30 stars 16 forks source link

issue with multiple URLs of associated sequences in interpreted data #5501

Open mnhn-paul opened 1 month ago

mnhn-paul commented 1 month ago

I noticed with this record https://www.gbif.org/occurrence/4605970382 an issue with the Associated sequences interpreted values field.

The interpreted data is shown as one single URL, the original seems to be correctly formatted for multiple URLs, as per the documentation: https://dwc.tdwg.org/terms/#dwc:associatedSequences

original: https://www.ncbi.nlm.nih.gov/nuccore/LC731499 | https://www.ncbi.nlm.nih.gov/nuccore/LC731501 becomes: https://www.ncbi.nlm.nih.gov/nuccore/LC731499|https://www.ncbi.nlm.nih.gov/nuccore/LC731501

MortenHofft commented 1 month ago

I would think both were correct in terms of raw data. But the UI should split based on the pipe. The problem is the automatic url linker that includes the pipe. It is a bit tricky since | is a allowed in urls as far as I know. But that seems an edge case. It is probably safe to just split on pipes