alastor1729 / sitemapgen4j

Automatically exported from code.google.com/p/sitemapgen4j
0 stars 0 forks source link

AbstractSitemapUrlRenderer should take care of escaping entities & and < in URLs #2

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
What steps will reproduce the problem?
1. add a URL containing a & in path, e.g. http://www.domain.com/user/me&you/
2. generate the sitemap
3. ampersand is not correctly encoded for XML

What is the expected output? What do you see instead?
ampersand should be encoded for XML:
http://www.domain.tld/user/me&amp;you/

What version of the product are you using? On what operating system?
1.0.1 on win xp

Please provide any additional information below.
Both & and < are valid characters of a URL, but not in XML
see URL RFC (3.3 / 3.4): http://www.ietf.org/rfc/rfc2396.txt
and XML Spec (2.4): http://www.w3.org/TR/REC-xml/

Original issue reported on code.google.com by asdfasdf...@mailinator.com on 9 May 2009 at 4:47

GoogleCodeExporter commented 8 years ago
Any changes for this problem? I have to encode the urls myself?

Original comment by pla...@gmail.com on 20 Aug 2010 at 7:54