Open ppf2 opened 7 years ago
Looking at the Rally code, it looks like we have since implemented support for decompressing gz files so this specific use case no longer applies. Will leave this issue open in case users out there can benefit from additional compression options in this input.
bzip2 is around 27% better compression of log files in my testing. Testing Logstash output I also see a 3.5% improvement using gzip --best bzip2 is MUCH slower of course which might be an issue. lzma is 45% better
it is possible that I am seeing the improvement due to the external utility's ability to chose different block size compared to the stream encoding.
I would like to see bz2 and lzma as a codec option also.
Currently, this plugin supports output to gzip, but not tarball gzip or zip or bzip2. It will be nice to provide additional compression options. Example use case here is use LS to get subset of documents from ES, scrub and transform the data, and output the documents to a file to feed into Rally for testing. Rally's documents setting currently supports only tar.gz, bz2 or zip files. Certainly, the end user can do post processing to convert the file themselves, so this is an ER for convenience.