apache / orc

Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
https://orc.apache.org/
Apache License 2.0
665 stars 477 forks source link

ORC-1708: Support data/compress options in Hive benchmark #1924

Closed cxzl25 closed 1 month ago

cxzl25 commented 1 month ago

What changes were proposed in this pull request?

This PR aims to support data/compress options in Hive benchmark.

Why are the changes needed?

Easy to do sampling tests in Hive benchmark.

How was this patch tested?

local test

java -jar hive/target/orc-benchmarks-hive-*-uber.jar read-all data  -data taxi -compress snappy

Was this patch authored or co-authored using generative AI tooling?

No

dongjoon-hyun commented 1 month ago

cc @williamhyun