DM2E / dm2e-mappings

0 stars 0 forks source link

URL-encoded slash in WebResource URL #77

Closed kba closed 8 years ago

kba commented 10 years ago

E.g. http://data.dm2e.eu/data/concept/bbaw/dta/HERR%2F in spener_piadesideria_1676.TEI-P5.xml

Happens in

and many more.

ksdm2e commented 10 years ago

Thanks for recognizing this.

This is no problem: the concepts in bbaw/dta are candidates for processing with SILK and not reliable concepts. They can contain what-ever, random-like strings.

However, I can replace all reserved characters (for URL) in the next version of the script.

ksdm2e commented 10 years ago

fyi:

   Example 2
   The URIs
    http://www.w3.org/albert/bertram/marie-claude
   and
   http://www.w3.org/albert/bertram%2Fmarie-claude
   are NOT identical, as in the second case the encoded slash does not have hierarchical significance.. 

From http://www.w3.org/Addressing/URL/4_URI_Recommentations.html

I understand it in that way, that an encoded slash is ok. Just a comment:

kba commented 10 years ago

Yes, it is not a forbidden character per se, it's just that it breaks Pubby and is confusing.

ksdm2e commented 10 years ago

oh, pubby? really? my friend ...

I changed it in the recent version already: concepts with reserved characters should be omitted.

kba commented 10 years ago

Can't you just replace the reserved characters with something like underscore? You don't have that problem for any external URIs, just for those that you create yourself in the 'http://data.dm2e.eu/data/' namespace.

ksdm2e commented 10 years ago

It's ok. These concepts are only suggestions.

Thanks! afk for now!

kba commented 10 years ago

Still happens erbkam_tagebuch02_1843.TEI-P5.xml