Open crepererum opened 1 day ago
Interesting, so the gRPC server will return an empty body with a 200 status code, which coincidentally is a valid S3 response?
S3 responses for GET
operations are just the pure data, there's NO wrapping of the bytes into any object like JSON or XML. All metadata is transmitted via headers, which our client doesn't need (also to support alternative S3 implementations more easily).
IMO this isn't something we should look to handle, if you point it at an HTTP server that returns 200 OK, how is it to know that this is wrong? Sure we could handle grpc-status, but it seems rather odd to me to be inspecting a protocol specific header.
Describe the bug :warning: I understand that this is a very niche issue, but I thought I share this trap with others.
If you accidentally point an S3
object_store
client to an gRPC endpoint, it will happily read empty objects for most paths (i.e. all paths that are not covered by the gRPC endpoint). This can become quite a debug nightmare.To Reproduce Set up a gRPC server, e.g. w/
tonic
. The code example for the client sets this up to port1234
.Then configure an S3 Client to point at it. It's important that you
Expected behavior I was naively expecting the client to error.
Additional context gRPC for some bizarre reasons decides to not use the HTTP status code at all but instead a custom response header
grpc-status
. In our case, this is set to12
forUNIMPLEMENTED
, see https://grpc.github.io/grpc/core/md_doc_statuscodes.html .The response body for
UNIMPLEMENTED
is empty. Thecontent-length
response header is set to0
(that's required by theobject_store
client).:arrow_right: So I think what we could do as some kind of safeguard would be to check the
grpc-status
response header and bail out if it is set.