jpmml / pyspark2pmml

Python library for converting Apache Spark ML pipelines to PMML
GNU Affero General Public License v3.0
95 stars 25 forks source link

installation method on spark cluster #19

Closed yairdata closed 5 years ago

yairdata commented 5 years ago

hi , a question about installation method . the jar of jpmml-sparkml can be broadcasted to all workers since we use the --jars flag, but does the pyspark2pmml wrapper need to be installed only on the edge node or on all the workers as well ?

vruusmann commented 5 years ago

I have no idea.

Moreover, I think that this question ("how do I distribute a Python wrapper for a Apache Spark package") really belongs to Apache Spark mailing list.