I managed to bypass the issue by modifying the upstream object_store crate to use x-goog-stored-content-length, but not sure if that's the correct way to go about it
let content_length = headers
.get(CONTENT_LENGTH)
.or_else(|| headers.get("x-goog-stored-content-length"))
.context(MissingContentLengthSnafu)?;
Is there something in Arroyo that can handle this?
Came across a weird issue trying to read gzipped NDJSON files in GCS -- the pipeline fails with:
{"message":"Error: Generic GCS error: Header: Content-Length Header missing from response. Retrying..."},"target":"arroyo_storage"}
Sending a HEAD request to the actual object shows that the Content-Length indeed is missing
I managed to bypass the issue by modifying the upstream object_store crate to use x-goog-stored-content-length, but not sure if that's the correct way to go about it
Is there something in Arroyo that can handle this?
(Could be potentially be a similar issue as https://github.com/apache/libcloud/issues/1544)