apache / orc

Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
https://orc.apache.org/
Apache License 2.0
689 stars 483 forks source link

ORC-1571: Supports displaying raw data size in the `meta` command of orc-tools #1726

Closed cxzl25 closed 10 months ago

cxzl25 commented 10 months ago

What changes were proposed in this pull request?

Display raw data size in meta command

Why are the changes needed?

We can directly see the compressed orc file size and uncompressed data size. Like parquet-cli

Row group 0:  count: 1000  210.95 B records  start: 4  total(compressed): 206.006 kB total(uncompressed):10.733 MB

How was this patch tested?

UT