server/authorizer: Fix gzip payload handling.

philipaconrad commented 3 months ago

What changed in this PR?

This PR fixes an issue where an OPA running authorization policies would be unable to handle gzipped request bodies.

Example OPA CLI setup:

opa run -s --authorization=basic

Example request:

echo -n '{}' | gzip | curl -H "Content-Encoding: gzip" --data-binary @- http://127.0.0.1:8181/v1/data

This would result in unhelpful error messages, like:

{
  "code": "invalid_parameter",
  "message": "invalid character '\\x1f' looking for beginning of value"
}

The cause was that the request body handling system in the server/authorizer package did not take gzipped payloads into account. The fix was to borrow the gzip request body handling function from server/server.go, to transparently decompress the body when needed.

Fixes: #6804

TODOs

[x] Add an authorizer test for the gzipped payload case (to prevent regressions!)

netlify[bot] commented 3 months ago

Deploy Preview for openpolicyagent ready!

Name	Link
Latest commit	0182db32a42f3f700245ebe48169775e838fbc20
Latest deploy log	https://app.netlify.com/sites/openpolicyagent/deploys/667c77b7948c3000083dbe38
Deploy Preview	https://deploy-preview-6825--openpolicyagent.netlify.app
Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

srenatus commented 3 months ago

I'm a bit surprised about the approach taken here 😅 I'd have expected us to change the ordering of handlers, since we’re setting up the authzHandler (via initRouters) before the compressHandler. One could argue that authz should happen before decompress (to avoid an attack vector, decompressing not-yet-authorized bodies), but if we end up decompressing in the authz handler anyways, that point is moot.

🤔 Am I missing something that makes this idea flawed? 😃

srenatus commented 3 months ago

Ooooh that linked code (CompressHandler) only takes care of compressing the response, doesn't it? That's not what we need, of course.

I guess the alternative approach taken that I had in mind would be to have some sort of middleware for both request payloads and response payloads. 💭

philipaconrad commented 3 months ago

@srenatus, I do think you've got a valid point w.r.t. the gzipped not-authorized-yet message bodies. I like the idea of having a uniform bit of middleware for decompressing requests! I'll take a look at how hard that would be to implement, and will ping back in this thread with the results.

srenatus commented 3 months ago

If we can keep authentication -> compression -> authorization, so authentication will be first, then I think it's no problem at all.

philipaconrad commented 3 months ago

Gotcha! Authentication first should not be an issue-- it's authorization and decompression that have a fixed ordering relative to each other.

philipaconrad commented 3 months ago

I think we actually could do some validation on gzip payload reads, but the method I have in mind relies on low-level knowledge of the format, namely, that the 8-byte trailer at the end of any valid gzip-compressed blob includes the CRC, as well as the decompressed file size in bytes, modulo 2^32 (4GB).

Here's a sketch of what a server/decoding plugin might look like, that would solve this problem a few levels above where the current PR fix is doing it:

Identical configuration style to the server/encoding plugin.
Identical low-level http.Handler hooks style to server/encoding. (Requiring some more initialization logic in server/server.go, so that the request decompressor wraps everything else)
New Gzip decompressor system uses the following techniques to allow rejection of bad gzip payloads:
- Try creating a gzip.NewReader(request.Body), as NewReader doesn't do any real work beyond reading the header, until (*gzip.Reader).Read is called.
- Manually grab the last 4 bytes from the gzip blob, and convert to a uint32 for checking against the plugin's configured size limits.
Prealloc read buffer, based on the gzip blob's declared size.
Use a limited-size read method to read out/decompress the payload into the buffer.

Maybe I'm overthinking things? I do think this approach would both boost performance for gzipped requests, and (if my understanding of the compress/gzip package is correct) would comprehensively resolve issue #6804, even for unauthenticated-but-authorized OPA deployments, at the expense of much heavier refactoring than I'd expected at the outset of trying to solve this issue. :sweat_smile:

Notes:

There's an open Go feature request issue for this since 2019: https://github.com/golang/go/issues/32539. In the thread, one commenter suggests that in trusted environments that the last 4 bytes could be useful for preallocating read buffers.

philipaconrad commented 3 months ago

@srenatus I've added the some gzip + authz policy test cases, so that's no longer a blocker. The tests have some extra cruft in them to allow turning the authz policy parts on/off while I experiment with the server/decoding plugin idea.

johanfylling commented 3 months ago

On this branch, if I have no authz policy and run:

echo -n '{"input": {}}' | curlie --data @- http://127.0.0.1:8181/v1/data

I get a 200 OK and the expected eval result. But if I run:

echo -n '{"input": {}}' | gzip | curlie -H "Content-Encoding: gzip" --data-binary @- http://127.0.0.1:8181/v1/data

I get:

HTTP/1.1 400 Bad Request
Content-Type: application/json
Vary: Accept-Encoding
Date: Tue, 18 Jun 2024 09:10:31 GMT
Content-Length: 96

{
    "code": "invalid_parameter",
    "message": "could not decompress the body: unexpected EOF"
}

Given that I don't do authn/authz on the request, it's not surprising if the changes in this PR aren't exercised, but wouldn't we expect the latter request to succeed?

philipaconrad commented 3 months ago

[...] But if I run:

echo -n '{"input": {}}' | gzip | curlie -H "Content-Encoding: gzip" --data-binary @- http://127.0.0.1:8181/v1/data

@johanfylling I tried running that command with plain curl, and got a 200 OK on both main, and this PR branch:

echo -n '{"input": {}}' | gzip | curl -H "Content-Encoding: gzip" --data-binary @- http://127.0.0.1:8181/v1/data

:arrow_down: Response:

{"result":{}}

I wonder if it's a behavior difference or a bug in curlie?

johanfylling commented 3 months ago

@philipaconrad , you're correct: curl works as expected. This must be a bug in curlie, which is supposed to just be a simple wrapper of curl to get some prettier output .. apparently they mess something up on the way through 😄.

philipaconrad commented 3 months ago

In tests with a 1GB gzipped input to curl, it looks like we're ~~decompressing twice in some cases, due to this change~~ unmarshaling twice, due to how the authorizer unmarshals inputs before the v1CompilePost endpoint works.

Other endpoints check the request context to see if the input was already parsed for them, but v1CompilePost does not. Since we probably do not want to have 2x the overhead for POST requests against /v1/data, I'm going to investigate adding a context lookup for that handler.

geninput.py:

N = 1024 * 1024
oneKBString = "A" * 1024

if __name__ == "__main__":
    print('{"input":{')
    for i in range(0, N):
        print('"{}": "{}",'.format(i, oneKBString))
    print('"{}": "{}"'.format(N, oneKBString))
    print("}}")

Example CLI (server):

/usr/bin/time -v ./opa-pr run -s --authorization=basic authz.rego

Example CLI (client):

python3 geninput.py | gzip | curl -H "Content-Encoding: gzip" --data-binary @- http://127.0.0.1:8181/v1/data

philipaconrad commented 3 months ago

:information_source: Here's the current state of things (for @johanfylling, @srenatus, and anyone else following along):

The good news:

This PR does indeed resolve #6804, which was its original intended purpose.
- It's also improved the state of testing around this gzip + authz combination, which is handy.

The bad news:

Due to how folks have written stuff in the server package, we will end up reading the message body twice for gzipped requests in some cases when authz policies are enabled.
- This unfortunately is the "intended" behavior that should have been happening before, but wasn't, due to #6804.
- When #5696 went in, it introduced cases where the message body could be io.ReadAll'd 2x or more by accident. That PR was focused on compressed OPA responses, not compressed requests to OPA, but it had to do something with the requests at the time, since gzip compatibility/detection relies on seeing appropriate gzip-related request headers.
Resolving the performance problems will require some extensive refactoring, and is unrelated to what this PR is attempting to solve-- I'd like to move the refactoring work to a follow-up PR for the next OPA release.

Unless there's any other comments/critiques/requests around this PR, I think it should be good-for-merge now.

open-policy-agent / opa