Open EnricoMi opened 1 week ago
Thanks for opening a pull request!
If this is not a minor PR. Could you open an issue for this pull request on GitHub? https://github.com/apache/arrow/issues/new/choose
Opening GitHub issues ahead of time contributes to the Openness of the Apache Arrow project.
Then could you also rename the pull request title in the following format?
GH-${GITHUB_ISSUE_ID}: [${COMPONENT}] ${SUMMARY}
or
MINOR: [${COMPONENT}] ${SUMMARY}
In the case of PARQUET issues on JIRA the title also supports:
PARQUET-${JIRA_ISSUE_ID}: [${COMPONENT}] ${SUMMARY}
See also:
:warning: GitHub issue #34698 has been automatically assigned in GitHub to PR creator.
@raulcd thanks, assertions adjusted
Rationale for this change
An acero node that turns an implicit ordering into an explicit ordering (rows sorted by some columns) is useful to re-use order that already exists in the data.
What changes are included in this PR?
This PR adds the
AssertOrderNode
that implements this logic. TheScanner
employs that node to turn the implicit ordering of theScanNode
into an explicit order as defined by user code viaScanBuilder.Ordering
.Are these changes tested?
There are unit tests for the
AssertOrderNode
as well as for theScanNode
andScanBuilder
.Are there any user-facing changes?
The following options has been added:
the
ordering
option added toScanOptions
the
Ordering
option added toScannerBuilder
the
AssertNodeOptions
classGitHub Issue: #34698