okfn / ckanext-tsbsatellites

repo for Satellites Catapult's CKAN
http://data.satapps.org/
GNU Affero General Public License v3.0
3 stars 3 forks source link

Duplicated datasets on remote CSW server #41

Closed amercader closed 10 years ago

amercader commented 10 years ago

Some duplicated datasets keep appearing on the catalogue. These are actual duplicates in the remote CSW server.

CKAN will consider any document with a non-existent fileIndentifier a new dataset, and add some random characters to its name.

Example:

Dataset 1 http://185.30.10.28:8081/dataset/nceo-atmospheric-profiles-retrieved-from-the-michelson-interferometer-for-passive-atmospheric-s

<gmd:fileIdentifier>
<gco:CharacterString xmlns:gco="http://www.isotc211.org/2005/gco">cf23a5d1-96da-434e-a9d0-8c244e564b73</gco:CharacterString>
</gmd:fileIdentifier>

Original document

Dataset 2 http://185.30.10.28:8081/dataset/nceo-atmospheric-profiles-retrieved-from-the-michelson-interferometer-for-passive-atmospheric-sc6401

<gmd:fileIdentifier>
<gco:CharacterString xmlns:gco="http://www.isotc211.org/2005/gco">a30879c6-115f-4cff-bd3a-44bf7f31aa96</gco:CharacterString>
</gmd:fileIdentifier>

Original document

adamamyl commented 10 years ago

mailed clients

adamamyl commented 10 years ago

Should now be fixed upstream, they "got a little over enthusiastic" :)