A flexible Java text processor. BB, BBCode, BB-code, HTML, Textile, Markdown, parser, translator, converter.
GNU Lesser General Public License v3.0
86
stars
14
forks
source link
Allow more characters in the path of a url. #46
Closed
jasperkrijgsman closed 8 years ago
Urls with paths containing non-alphanumeric characters like '&' which are allowed by RFC3986 are not parsed correcly.
A path segment (the parts in a path separated by /) in an absolute URI path can contain zero or more of pchar that is defined as follows:
pchar = unreserved / pct-encoded / sub-delims / ":" / "@" pct-encoded = "%" HEXDIG HEXDIG unreserved = ALPHA / DIGIT / "-" / "." / "_" / "~" sub-delims = "!" / "$" / "&" / "'" / "(" / ")" / "*" / "+" / "," / ";" / "="