GSA / data.gov

Main repository for the data.gov service
https://data.gov
Other
594 stars 92 forks source link

Incorrect metadata standard guess #4349

Open chris-macdermaid opened 1 year ago

chris-macdermaid commented 1 year ago

How to reproduce

  1. Description of steps to reproduce the issue.

ckanext-spatial's guess_standard isn't catching records like this National Address Database record because it doesn't explicitly include the namespace. It uses MD_Metadata vs gmd:MD_Metadata. The namespace doesn't need to be included for it to be valid. It defaults to gmd by definition.

The guess_standard returns unknown for the record. The harvester shows that it tries to run the CSDGM->ISO transform which fails.

This affects the harvesting of the DOT National Address Database record.

Expected behavior

guess_standard should recognize the default namespace for a record and treat the record as an ISO record.

Actual behavior

The guess_standard returns unknown for the record. The harvester shows that it tries to run the CSDGM->ISO transform which fails.

Records using a default namespace fail to be harvested into the Data.gov Catalog.

chris-macdermaid commented 1 year ago

Attaching the XML file in case the linked file goes away.

NationalAddressDatabaseMetadata-xml.txt

The record is harvested via this harvest https://catalog.data.gov/harvest/usdot-geospatial-metadata