cessda / cessda.cdc.versions

Issue track and wiki for the CESSDA Data Catalogue
https://datacatalogue.cessda.eu/
Apache License 2.0
0 stars 0 forks source link

SND name not in correct format #113

Closed cessda-bitbucket-importer closed 5 years ago

cessda-bitbucket-importer commented 5 years ago

Original report on BitBucket by Taina Jääskeläinen.


In the publisher filter, the Swedish archive is in the form: Swedish national data service

The official name is: Swedish National Data Service

Is this a metadata issue or some at CDC end? If a CDC catalogue issue, please amend.

cessda-bitbucket-importer commented 5 years ago

Original comment by John Shepherdson (GitHub: john-shepherdson).


It is a metadata issue as far as I can tell, as the JSON says "publisher":{"abbr":"SND","publisher":"Swedish national data service"}

However, there are 1,480 records with this publisher, which seems unlikely.

@doraVentures please check that we are receiving the publisher in this format (Swedish national data service) so we can rule out a data transformation error at our end.

cessda-bitbucket-importer commented 5 years ago

Original comment by Moses Mansaray (GitHub: doraVentures).


In the publisher filter, the Swedish archive is in the form: Swedish national data service

The official name is: Swedish National Data Service

Is this a metadata issue or some at CDC end? If a CDC catalogue issue, please amend.

cessda-bitbucket-importer commented 5 years ago

Original comment by Ashley Fox.


FYI the UI is not doing any transformation to the publisher field. It is reading it directly from the JSON…

{
  "_source": {
    "publisher": {
      "abbr": "Abbreviation",
      "publisher": "This property is used by the UI"
    }
  }
}
cessda-bitbucket-importer commented 5 years ago

Original comment by Moses Mansaray (GitHub: doraVentures).


Okay, I’ll look into this. Cheers.

cessda-bitbucket-importer commented 5 years ago

Original comment by Moses Mansaray (GitHub: doraVentures).


Short answer: Root cause: SPs Metadata issue.

Re: please check that we are receiving the publisher in this format (Swedish national data service) so we can rule out a data transformation error at our end.

Yes, we are. See examples bellow


Re: …there are 1,480 records with this publisher, which seems unlikely.

Listing the identifiers for SND today 12/05/2019 shows total available records to be 1845 of which we can deduce 1480 have successfully been harvested by us with this number of XML Records passing the CMM minimum required fields for a record to be ingested into elasticsearch.

cessda-bitbucket-importer commented 5 years ago

Original comment by Moses Mansaray (GitHub: doraVentures).


Over to you @john-shepherdson

cessda-bitbucket-importer commented 5 years ago

Original comment by Taina Jääskeläinen.


Seems to be a metadata issue then. Have informed SND. The issue can probably be resolved.

cessda-bitbucket-importer commented 5 years ago

Original comment by John Shepherdson (GitHub: john-shepherdson).


Added to metadata office issue tracker.

See https://github.com/cessda/cessda.metadata.office/issues/8

cessda-bitbucket-importer commented 5 years ago

Original comment by John Shepherdson (GitHub: john-shepherdson).


In the publisher filter, the Swedish archive is in the form: Swedish national data service

The official name is: Swedish National Data Service

Is this a metadata issue or some at CDC end? If a CDC catalogue issue, please amend.

cessda-bitbucket-importer commented 4 years ago

Original comment by Taina Jääskeläinen.


It was a OAI-PMH issue which has now been amended at SND. So the issue can remain closed.