AlexIoannides / pyspark-example-project

Implementing best practices for PySpark ETL jobs and applications.
1.56k stars 672 forks source link

New best practices #1

Closed AlexIoannides closed 6 years ago

AlexIoannides commented 6 years ago

Updated to include updates to what I consider to be 'best practices' for ETL using Apache Spark.