Our CSW connector currently looks for distributions at JSON path: $.distributionInfo[*].MD_Distribution[*].transferOptions[*].MD_DigitalTransferOptions[*].onLine[*].CI_OnlineResource[*]
However, some datasets my have distribution info published at: $.distributionInfo[*].MD_Distribution[*].distributionFormat[*].MD_Format[*].formatDistributor[*].D_Distributor[*].distributorTransferOptions[*].MD_DigitalTransferOptions[*].onLine[*].CI_OnlineResource[*]
Proposed Fix
Have CSW connector search for all MD_DigitalTransferOptions nodes rather than under any particular JSON path.
Problem description
Knowledge network uses CSW connector to crawl TERN data source. Endpoint: https://geonetwork.tern.org.au/geonetwork/srv/eng/csw
The crawl result shows some datasets has 0 distribution.
Example is:
https://staging-test.knowledgenet.co/dataset/ds-tern-45e488c7-38ad-40a8-97e7-ad16a9c1c8f9
The dataset detail is:
The crawling URL for that dataset is:
https://geonetwork.tern.org.au/geonetwork/srv/eng/csw?service=CSW&version=2.0.2&request=GetRecordById&elementsetname=full&outputschema=http%3A%2F%2Fwww.isotc211.org%2F2005%2Fgmd&typeNames=gmd%3AMD_Metadata&id=45e488c7-38ad-40a8-97e7-ad16a9c1c8f9
Our CSW connector currently looks for distributions at JSON path:
$.distributionInfo[*].MD_Distribution[*].transferOptions[*].MD_DigitalTransferOptions[*].onLine[*].CI_OnlineResource[*]
However, some datasets my have distribution info published at:
$.distributionInfo[*].MD_Distribution[*].distributionFormat[*].MD_Format[*].formatDistributor[*].D_Distributor[*].distributorTransferOptions[*].MD_DigitalTransferOptions[*].onLine[*].CI_OnlineResource[*]
Proposed Fix
Have CSW connector search for all
MD_DigitalTransferOptions
nodes rather than under any particular JSON path.