wmo-im / wcmp

WMO Core Metadata Profile
https://github.com/wmo-im/wcmp
6 stars 6 forks source link

New KPI regarding filename in metadata #81

Open jsieland opened 3 years ago

jsieland commented 3 years ago

Summary and Purpose Comment from my GISC Offenbach colleagues.

Proposal

Reason

tomkralidis commented 3 years ago

@jsieland thanks for reporting. Can we get clarification? What filename are we referring to (in which element of WCMP)?

Keep in mind URLs may point to files with filenames or an API call (which has no filename per se).

josusky commented 3 years ago

According to publication 1060, requirement 8.1.1: "WIS discovery metadata record shall be uniquely identified using the gmd:​MD​_Metadata/​gmd:​fileIdentifier attribute". In case of metadata for traditional bulletins this can be easily translated to the heading (AHL). But for other types of data, I suppose that is what @jsieland means, we need some other identifier. There is the so called "online resource", here is an example:

 <gmd:transferOptions>
        <gmd:MD_DigitalTransferOptions>
<gmd:onLine>
            <gmd:CI_OnlineResource>
              <gmd:linkage>
                <gmd:URL>http://gisc.dwd.de/wisportal/#SearchPlace:q?pid=urn:x-wmo:md:int.wmo.wis::ISMD01EDZW</gmd:URL>
              </gmd:linkage>
              <gmd:protocol>
                <gco:CharacterString>http</gco:CharacterString>
              </gmd:protocol>
              <gmd:name>
                <gco:CharacterString>GISC Offenbach, Deutscher Wetterdienst</gco:CharacterString>
              </gmd:name>
              <gmd:description>
                <gco:CharacterString>WMO Information System, download products/data through GISC Offenbach, Deutscher Wetterdienst</gco:CharacterString>
              </gmd:description>
            </gmd:CI_OnlineResource>
          </gmd:onLine>

Is this the URL you ar referring to? The "CI_OnlineResource" has attributes and contains (may contain) additional elements, I guess, one of those could be used to give (optional) information about file name. Of course, it should be some pattern, not a fixed name, as the actual file name may contain variable parts (e.g. date/time). I see that "gmd:name", that looks like a good place to me, is currently used for things like that are more a description than a name, could we (re)define the meaning of this element? Or perhaps use something else - @jsieland any proposals?

jsieland commented 3 years ago

The idea is to be able to access the data automatically (as far as possible) based on the metadata. This means that the metadata contains a reference that can also be used by automated processes to access the respective data:

I like the idea of using the additional elements of gmd:CI_OnlineResource - as far as I'm aware INSPIRE recommends using gmd:name, gmd:description and gmd:function/gmd:CI_OnLineFunctionCode.