nanoos-pnw / NCEI-archiving

Code, documentation and issue tracking for NANOOS NCEI archiving
Apache License 2.0
1 stars 1 forks source link

CSDGM Validation from oXygen results for DOGAMI metadata record #13

Open MathewBiddle opened 3 years ago

MathewBiddle commented 3 years ago

NCEI has ran the example metadata file DOGAMI_Rockaway_Metdata_v2.xml through the CSDGM Validation from oXygen and below are the results. Note, this is just covering the basic validation, this doesn't go into the details that will be necessary when transformed into ISO.

  1. Invalid content was found starting with element rngdates. No child element is expected at this point.
    1. Doesn’t like having single date AND range of dates in Time Period of Information.
  2. Ongoing is not a valid value of union type enddateType.
  3. The value Ongoing of element enddate is not valid.
    1. Domain is "Unknown" "Present" free date, not Ongoing.
  4. Invalid content was found starting with element keywords. One of {spdom} is expected.
    1. Looking for Spatial Domain - the geographic areal domain of the data set. Doesn’t include bounding coordinates.
  5. Invalid content was found starting with element themekey. One of {themekt} is expected.
    1. No Theme Keyword Thesaurus -- reference to a formally registered thesaurus or a similar authoritative source of theme keywords.
    2. Can enter “None” if no thesaurus is used
  6. Invalid content was found starting with element placekey. One of {placekt} is expected.
    1. No Place Keyword Thesaurus -- reference to a formally registered thesaurus or a similar authoritative source of place keywords
    2. Can enter “None” if no thesaurus is used
  7. Invalid content was found starting with element cntinfo. No child element is expected at this point.
  8. The content of element cntinfo is not complete. One of {cntaddr} is expected.
    1. Looking for Contact Address -- the address for the organization or individual. Note that <cntper> includes contact address.
  9. Invalid content was found starting with element browset. One of {browsen} is expected.
    1. Looking for Browse Graphic File Name -- name of a related graphic file that provides an illustration of the data set
  10. Invalid content was found starting with element secclass. One of {secsys} is expected.
    1. Looking for Security Classification System -- name of the classification system.
  11. ±1 cm + 1 ppm (parts per million × the baseline length) is not a valid value for double.
    1. text entered in “”real” field.
  12. The value ±1 cm + 1 ppm (parts per million × the baseline length) of element horizpav is not valid.
    1. Element Horizontal Positional Accuracy Value -- an estimate of the accuracy of the horizontal coordinate measurements in the data set expressed in (ground) meters. has Domain: free real
  13. ±2 cm + 1 ppm (parts per million × the baseline length) is not a valid value for double.
  14. The value ±2 cm + 1 ppm (parts per million × the baseline length) of element vertaccv is not valid.
    1. Vertical Positional Accuracy Value -- an estimate of the accuracy of the vertical coordinate measurements in the data set expressed in (ground) meters has Domain: free real
  15. Invalid content was found starting with element srctime. One of {srcscale, typesrc} is expected.
    1. Looking for Source Scale Denominator -- the denominator of the representative fraction on a map (for example, on a1:24,000-scale map, the Source Scale Denominator is 24000).
    2. Looking for Type of Source Media -- the medium of the source data set.
  16. Ongoing is not a valid value of union type enddateType.
  17. The value Ongoing of element enddate is not valid.
    1. Ending Date Domain: "Unknown" "Present" free date
  18. Element lineage has no value for the key srcciteaKey.
    1. Source Citation should include Source Citation Abbreviation -- short-form alias for the source citation.
  19. Invalid content was found starting with element proccont. One of {srcused, procdate} is expected.
    1. Looking for Source Used Citation Abbreviation -- the Source Citation Abbreviation of a data set used in the processing step.
    2. Looking for Process Date -- the date when the event was completed; Domain: "Unknown" "Not complete" free date
  20. The content of element planar is not complete. One of {planci} is expected.
    1. Looking for Planar Coordinate Information -- information about the coordinate system developed on the planar surface.
  21. Invalid content was found starting with element attr. One of {enttyp} is expected.
    1. Entity Type -- the definition and description of a set into which similar entity instances are classified
  22. The content of element attr is not complete. One of {attrdefs} is expected. (x5)
    1. Attribute Definition Source -- the authority of the definition.
  23. The content of element digform is not complete. One of {digtopt} is expected.
    1. Digital Transfer Option -- the means and media by which a data set is obtained from the distributor
  24. The content of element stdorder is not complete. One of {digform, fees} is expected.
    1. Digital Form -- the description of options for obtaining the data set on computer-compatible media.
    2. Fees -- the fees and terms for retrieving the data set.
  25. Invalid content was found starting with element rngdates. No child element is expected at this point.
    1. Range of Dates has invalid ending date. See below
  26. Ongoing is not a valid value of union type enddateType.
  27. The value Ongoing of element enddate is not valid.
    1. Domain: "Unknown" "Present" free date
  28. Invalid content was found starting with element metc. One of {metd} is expected.
    1. Domain: free date
emiliom commented 3 years ago

Thanks! This will be helpful when we get back to this task, hopefully next week.

Maybe I should renew my lapsed oXygen subscription ...

crisien commented 3 years ago

FYI I did forward these results to Jon.