vectordotdev / vector

A high-performance observability data pipeline.
https://vector.dev
Mozilla Public License 2.0
18.14k stars 1.6k forks source link

Support `zstd` compression in the `file` source #16891

Open zamazan4ik opened 1 year ago

zamazan4ik commented 1 year ago

A note for the community

Use Cases

Right now Vector supports reading files compressed only with GZip - https://vector.dev/docs/reference/configuration/sources/file/#compressed-files . But other algorithms (like Zstd or Snappy) could be used nowadays to store compressed logs.

Attempted Solutions

Try to unpack logs externally before passing them to Vector but it's too inconvenient solution.

Proposal

Add support for zstd compression.

References

No response

Version

No response

jszwedko commented 1 year ago

@zamazan4ik do you have a specific compression algorithm you are interested in seeing support for? I think we should have one issue per algorithm.

zamazan4ik commented 1 year ago

Especially me interested in the following algorithms:

Feel free to create an issue per requested algorithm :)

jszwedko commented 1 year ago

Thanks @zamazan4ik ! I'll repurpose this one for zstd since I think that is likely to be most popular. We can let users open issues for the others with their use-cases.

zamazan4ik commented 1 year ago

In my practice, additionally to GZip, I've seen logs in Zstd and Bzip formats, stored in files. Never seen logs in Snappy but could be useful to support it as well.

dermetfan commented 1 year ago

This looks like a duplicate of #13500

jszwedko commented 1 year ago

Agreed, I'll close the other one since this is more focused. Thanks for flagging @dermetfan

mutexed commented 7 months ago

Hello, any updates in this ticket? Interested in zstd compression for file source, may be there is some workaround?

@jszwedko

kvlas commented 3 months ago

Any update? I'm also interested in zstd for file source.