Closed tharunsikhinam closed 5 years ago
1) store a subset of raw data in the repo 2) run amazon and goodreads scripts to generate cleaned data and dump to mongodb 3) test cases in spark
data folder added and also added test data
1) store a subset of raw data in the repo 2) run amazon and goodreads scripts to generate cleaned data and dump to mongodb 3) test cases in spark