data-derp / small-exercises

6 stars 9 forks source link

[Content] [Wrangling in Spark] material 02 Setting Spark Connection - how to choose number of shuffle partitions #15

Closed kelseymok closed 1 year ago

kelseymok commented 1 year ago

Setting Spark Connection from https://github.com/data-derp/small-exercises/blob/master/wrangling-in-spark/wrangling-in-spark.dbc

Maybe add a cell on considerations of how to best choose the number of shuffle partitions? Like https://sparkbyexamples.com/spark/spark-shuffle-partitions/