open-data / ckanext-canada

Government of Canada CKAN Extension - Extension à CKAN du Gouvernement du Canada
Other
55 stars 36 forks source link

http download links missing on some datasets on registry #108

Closed pjakobsen closed 11 years ago

pjakobsen commented 11 years ago

HTTP download link not available on geogratis but is available on certain records in the registry

http://registry.statcan.gc.ca/en/dataset/af4e99f3-47b0-59cb-b846-60ae95dd2ca7

pjakobsen commented 11 years ago

It's hard to say why this is the case, because two links, both PDF and GeoScan were obtained from the NAP metadata file resource listings. Two possible reasons: Geoscan file has been filtered out at Geogratis, or there is a bug in the Geogratis site. Will investigate further.

pjakobsen commented 11 years ago

Just noticed that while both resources are available in the .nap files, only the PDF file is listed in the atom file.

http://geogratis.gc.ca/api/en/nrcan-rncan/ess-sst/af4e99f3-47b0-59cb-b846-60ae95dd2ca7.atom vs. http://geogratis.gc.ca/api/en/nrcan-rncan/ess-sst/af4e99f3-47b0-59cb-b846-60ae95dd2ca7.nap

Nap includes these elements:

<gmd:onlineResource>
                          <gmd:CI_OnlineResource>
                             <gmd:linkage>
                                <gmd:URL>http://geoscan.ess.nrcan.gc.ca/cgi-bin/starfinder/0?path=geoscan.fl&amp;id=fastlink&amp;pass=&amp;format=FLFULL&amp;search=R=289859</gmd:URL>
                             </gmd:linkage>
                             <gmd:protocol>
                                <gco:CharacterString xmlns:gco="http://www.isotc211.org/2005/gco">HTTP</gco:CharacterString>
                             </gmd:protocol>
                             <gmd:name>
                                <gco:CharacterString xmlns:gco="http://www.isotc211.org/2005/gco">GEOSCAN</gco:CharacterString>
                             </gmd:name>
                             <gmd:description>
                                <gco:CharacterString xmlns:gco="http://www.isotc211.org/2005/gco">GEOSCAN is the bibliographic database for scientific publications of the Earth Sciences Sector (ESS) of Natural Resources Canada.</gco:CharacterString>
                             </gmd:description>
                             <gmd:function>
                                <gmd:CI_OnLineFunctionCode codeList="http://nap.geogratis.gc.ca/metadata/register/napMetadataRegister.xml#IC_88"
                                                           codeListValue="RI_376">information; information</gmd:CI_OnLineFunctionCode>
                             </gmd:function>
                          </gmd:CI_OnlineResource>

And here is the ATOM which is clearly different:

<gmd:CI_OnlineResource>
              <gmd:linkage>
                <gmd:URL>ftp://ftp2.cits.rncan.gc.ca/pub/geott/ess_pubs/289/289859/of_6882.pdf</gmd:URL>
              </gmd:linkage>
              <gmd:protocol>
                <gco:CharacterString xmlns:gco="http://www.isotc211.org/2005/gco">FTP</gco:CharacterString>
              </gmd:protocol>
              <gmd:name>
                <gco:CharacterString xmlns:gco="http://www.isotc211.org/2005/gco">PDF</gco:CharacterString>
              </gmd:name>
              <gmd:description>
                <gco:CharacterString xmlns:gco="http://www.isotc211.org/2005/gco">Download PDF file through FTP</gco:CharacterString>
              </gmd:description>
            </gmd:CI_OnlineResource>