Closed divayjindal95 closed 4 years ago
Sorry for the late reply - I've only just seen this for some reason.
If you haven't already figured it out, the problem is that you can't package scikit-learn in this way - you'll have to manually distribute it to every node of the cluster, because it has a dependency on bumpy, which requires C code to be compiled locally on each node...
Hi, I am referring to your project in order to write ETL apps using pyspark. I am just importing sklearn in my app. I am running spark-submit locally.
jobs/etl_job.py fails with the following error: