Open j7nhai opened 1 week ago
Did you enable sort based shuffle? Hash shuffle has this issue.
Did you enable sort based shuffle? Hash shuffle has this issue.
Is it a bug and will be fixed in the future? May I know the reason why the disk is rising?
I didn't see any config to force enable sort based shuffle to avoid hash shuffle. Could I just decrease the value of spark.gluten.sql.columnar.shuffle.sort.columns.threshold
It's the design of hash shuffle and one reason we implemented the sort shuffle.
You may decrease the two, first one is the threshold of reducer#, second one is the threshold of column#.
spark.gluten.sql.columnar.shuffle.sort.partitions.threshold
spark.gluten.sql.columnar.shuffle.sort.columns.threshold
Backend
VL (Velox)
Bug description
When run ssb-q4.2 with scale 100T and enable columnar shuffle writes, we found that shuffle write byte added up of all stages increase as the number of partitions increases. However, when disable gluten, the growth trend of vanilla spark is not so obvious.
The following table shows the shuffle write bytes sum by all stages.
Spark version
None
Spark configurations
No response
System information
No response
Relevant logs
No response