Read in model without spark context

When reading the pmml model in, will I be able to do so without spark context?

The spark context is typically needed for loading the pipeline object from file into memory:

SparkSession sparkSession = ...;

MLReader<PipelineModel> mlReader = new PipelineModel.PipelineModelReader();
// THIS!
mlReader.session(sparkSession);

PipelineModel pipelineModel = mlReader.load(tmpPipelineDir.getAbsolutePath());

After that, the underlying Java converter component org.jpmml.sparkml.PMMLBuilder has no extra use for it.

The example reading the model in from README.md required the spark context as below

This example is about using JPMML-SparkML in PySpark environment. If you check the source code of pyspark2pmml.PMMLBuilder class, you can see that the spark context is used for obtaining a handle to the active JVM runtime (sc._jvm.).

TLDR: Figure out your exact application scenario - are you working in Java/Scala, or Python/PySpark? Do you already have an Apache Spark instance running somewhere at the time of conversion?

You can always start a local/temporary spark context if nothing else works.

jpmml / pyspark2pmml

Read in model without spark context #36