Open clouduol opened 3 years ago
At least according to RFC 2616, Chrome's behavior seems correct. A header name is a sequence of token
, and token
is any character excluding CTL
s or SEPARATOR
s, neither of which includes "
. However RFC 7230 (which obsoletes 2616) defines a token
as a sequence of tchar
, a class which specifically excludes double quotes.
I'd think a parser should be able to handle the old definition because it's impossible to tell whether a given message conforms to RFC 2616 or 7230 (both describe HTTP/1.1), but practically the new definition seems to have been changed because parsing the old one is much more complex than it initially seems and many implementations differ in their interpretation.
As a related example, RFC 7230 (section 3.2.4) deprecates line folding for header values and allows servers receiving folded lines to reject them, but also requires user agents to accept folded lines by converting to sequences of spaces before interpreting. In that particular instance, it seems like this library should implement the behavior specified for user agents because that is also permitted for servers.
It seems like the rationale in #68 applies here as well though: supporting the old behavior is difficult and probably slow, so it's simply not supported.
https://github.com/seanmonstar/httparse/pull/114 will let you ignore the invalid header, which is what Chrome is doing.
In the header name map, index 34 is false. So when parsing headers, the program will return Error::HeaderName when meeting double quotes(ascii number is 34) . However, bouble quotes in headers can be parsed correctly in chrome. Is this intentional or a bug?
Test Example: