Open Dandandan opened 2 months ago
There is a proposal in DataFusion for adding a hook to support sharing the join state https://github.com/apache/datafusion/pull/12523
We tested this at Coralogix, this works very well for us.
It could be disabled as well, although that will likely hurt performance by quite a bit.
should we take this once it gets merged in DF ?
Describe the bug See discussion here https://github.com/apache/datafusion/issues/12454
The "broadcast join" (CollectLeft) is wrong for certain join types which produce results on unmatched left rows.
To Reproduce Run a broadcast join with left / full outer on more than one node
Expected behavior
Additional context