nchammas / flintrock

A command-line tool for launching Apache Spark clusters.
Apache License 2.0
636 stars 116 forks source link

Ensure `python3` executable is available on cluster #334

Closed nchammas closed 3 years ago

nchammas commented 3 years ago

It looks like starting in Spark 3.1 (?) calling pyspark will specifically look for a python3 executable. Flintrock should ensure this executable is available on the cluster.

It's clear that the FlintrockService abstraction needs to be reused to capture things like Java and Python configuration, instead of doing that in a haphazard way. I'll explore that in a future PR (unless a contributor happens to take interest in this issue).