aylabs / bigdata-practical-intro

A practical intro to Big Data based on Apache Spark
Apache License 2.0
1 stars 0 forks source link

The importance of partitioning #8

Open acs opened 5 years ago

acs commented 5 years ago

It is key to understand why data partitioning is needed and its implications.

There are great resources about it like:

https://medium.com/parrot-prediction/partitioning-in-apache-spark-8134ad840b0