Closed blairdrummond closed 4 years ago
@ca-scribner
Successfully connected to a path 2 databricks cluster using databricks-connect and info about the databricks instance and a live cluster. Can run pyspark commands from a local pycharm, etc., using cluster for compute. Not sure how to translate this to using pyspark from a notebook server in kubernetes.
I don't think the databricks-connect method is the way - is a connection through vanilla pyspark appropriate? Steve's medium post goes that route but I couldn't fully reproduce it.
@sylus / @zachomedia any tips or past code fragments using pyspark? I only saw this in the repos
I still need to do some final tweaks and will let you know but this is roughly how I ported the actions one over and gave an example using kfp dsl.
Would be great to see databricks API integration and ideally pyspark examples.