Open anhnongdan opened 6 years ago
thangnguyen_backtest_oneclick_20190629 14_48 - Details for Stage 8104 (Attempt 0).pdf
This is an example for bad coalescing, a small file is calculated from some giant files and then the cluster stuffed all task into one single node!
Update in 2019-01-03
Never use coalesce with complex calculation (join, aggregate), especially with small number of partitions (<30). => Small num_part limit the parallelization of the tasks and spill out the memory.