Open bertvannuffelen opened 5 years ago
eerste analyse maken.
https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fmetadata.geopunt.be%2Fzoekdienst%2Fsrv%2Fdut%2Fcsw%3FOUTPUTFORMAT%3Dapplication%252Fxml%26SERVICE%3DCSW%26OUTPUTSCHEMA%3Dhttp%253A%252F%252Fwww.isotc211.org%252F2005%252Fgmd%26REQUEST%3DGetRecords%26VERSION%3D2.0.2%26constraintLanguage%3DCQL_TEXT%26RESULTTYPE%3Dhits%26constraint%3Dsubject%2520like%2520%2527Vlaamse%3Fopen%3Fdata%2527%26constraint_language_version%3D1.1.0&data=02%7C01%7Cbert.vannuffelen%40kb.vlaanderen.be%7C495080983971474b28ae08d6deb9f7e5%7C0c0338a695614ee8b8d64e89cbd520a0%7C0%7C0%7C636941287647084307&sdata=%2F8tZOpZzl5I1xEk9ttD33WzWiVJyqQRPcBsIv2UOwdw%3D&reserved=0
The problem is as follows:
According to https://tools.ietf.org/html/rfc1738 is teh \
an unsafe symbol and should it be escaped.
The following call to geonetwork shows that
wget -o csw.3 -O result.csw.3 "https://metadata.geopunt.be/zoekdienst/srv/dut/csw?OUTPUTFORMAT=application%2Fxml&SERVICE=CSW&OUTPUTSCHEMA=http%3A%2F%2Fwww.isotc211.org%2F2005%2Fgmd&REQUEST=GetRecords&VERSION=2.0.2&constraintLanguage=CQL_TEXT&RESULTTYPE=results&constraint=subject%20like%20%27Vlaamse%3Fopen%3Fdata%27%20and%20subject%20like%20%27%25hydrografische%25%27&constraint_language_version=1.1.0"
this is not the case. So the resulting RDF created from these urls is incorrect, as the above url will be used as an URI and not as a literal value.
voorstel van oplossing
De geonetwerk harvester doet geen correcte url encoding van de download-urls: