Apress / learn-pyspark

Source Code for 'Learn PySpark' by Pramod Singh
Other
25 stars 43 forks source link

Data (csv files) for Chapter 5 are missing? #1

Open hanytran opened 4 years ago

hanytran commented 4 years ago

There is not any data files in chapter 5. The current one is only for classification part. Please check.

peterhaglich commented 4 years ago

I thought of trying to create data frames for those but instead I used the classification data and selected columns to walk through the correlation, encoding, and chi-square test portions. The answers won't be the same as the text, obviously. Still, you should be able to see the formatted output.