Closed letian-jiang closed 11 months ago
Would you mind first create an issue like: https://issues.apache.org/jira/browse/PARQUET-2299 or use MINOR?
There is no padding between values (except for the last byte) which is padded with 0s.
This change looks good to me.
Dictionary page format: the entries in the dictionary - in dictionary order - using the plain encoding.
Does this means value in data page is same as position in dictionary page? 🤔
Also cc @wgtmac @gszadovszky
Would you mind first create an issue like: https://issues.apache.org/jira/browse/PARQUET-2299 or use MINOR?
I will create a related issue once my JIRA account request is approved.
Does this means value in data page is same as position in dictionary page? 🤔
I think so. The data page contains dictionary code (i.e. offset in dictionary page )
Made some updates. Please take another look. @mapleFU @gszadovszky @JFinis
The dictionary entries are not sorted (or at least not always sorted).
Minor change.
Jira
Commits