[BUG] Databricks integration is not working as documented

0.9.72

After reading the documentation the pipeline type in file metadata.yaml cannot be set to databricks it's always return back to python.

In result there is no support to convert spark dataframe to pandas dataframe:

df_spark = spark.sql("select * table") return a spark dataframe generate a pickle errror

to make it work you need to explicitly convert to pandas df_spark = spark.sql("select * table") .toPandas() works

Run data loader (generic)

df_spark = spark.sql("select * from dev.test.student")
return df_spark

Support databricks pipeline as documented Handle spark dataframe as expected return

Screenshot 2024-07-01 at 13 42 14

Docker on macos

No response

mage-ai / mage-ai