Is your feature request related to a problem? Please describe.
For large-scale anti-joins, we can speed it up by not performing an expensive repartition on both sides. Since the LHS is very heavy (keys and values), but the RHS is lightweight (keys only), it can be much cheaper to not perform any repartitioning and just do an all-to-all join by performing a local anti-join for each LHS partition, for each RHS partition.
Is your feature request related to a problem? Please describe.
For large-scale anti-joins, we can speed it up by not performing an expensive repartition on both sides. Since the LHS is very heavy (keys and values), but the RHS is lightweight (keys only), it can be much cheaper to not perform any repartitioning and just do an all-to-all join by performing a local anti-join for each LHS partition, for each RHS partition.