apache / datafusion

Apache DataFusion SQL Query Engine
https://datafusion.apache.org/
Apache License 2.0
5.21k stars 957 forks source link

Sort Merge Join. LeftSemi issues #10379

Open comphead opened 2 weeks ago

comphead commented 2 weeks ago

Describe the bug

The ticket is umbrella for Sort Merge Join LeftSemi bugs. Currently Sort Merge Join LeftSemi crashes or gives the wrong result.

To Reproduce

To reproduce it is enough to run Q21 TPCH bench in bench.sh

 RUST_BACKTRACE=1 RESULTS_NAME=smj ./benchmarks/bench.sh run tpch_smj

Or just Q21

    $CARGO_COMMAND --bin tpch -- benchmark datafusion --iterations 5 --query 21 --path "${TPCH_DIR}" --prefer_hash_join false --format parquet -o ${RESULTS_FILE}

Expected behavior

No response

Additional context

No response