Closed aw-west-defra closed 1 year ago
Sedona 1.4.1 has 2 spark-shaded jars for scala 2.12 named sedona-spark-shaded-3.0_2.12-1.4.1.jar
and sedona-spark-shaded-3.4_2.12-1.4.1.jar
. If you are using sedona-spark-shaded-3.0_2.12-1.4.1.jar
in your Databricks cluster, you can try switching to sedona-spark-shaded-3.4_2.12-1.4.1.jar
, which is targeting Spark 3.4.
The GeoParquet reader also has a known problem of not working on Databricks with photon enabled. If your cluster does not have photon enabled, switching the spark-shaded jar should solve this problem.
Thank you! I was using spark version 3.0 instead of 3.4.
Expected behaviour
I want to read and save my data using GeoParquet. I can save, but not read.
Actual behaviour
```java Py4JJavaError Traceback (most recent call last) Filejava.lang.NoSuchMethodException: java.lang.NoSuchMethodError: org.apache.spark.sql.internal.SQLConf.parquetFilterPushDownStringStartWith()Z`
Steps to reproduce the problem
Register sedona, and get an example dataset.
This fails to read that dataset in.
But I can save as geoparquet.
Am I missing out on bbox and indexing performance not using GeoParquet?
Settings
Sedona version = 1.4.1 Apache Spark version = 3.4.0 API type = Python Scala version = 2.12 JRE version = 1.8 Python version = 3.10 Environment = Azure Databricks - DBR 13.2
I don't believe it is due to my config/init script. I have previously failed to use GeoParquet on DBR 9 LTS, and 12 LTS, would it help to recreate those errors?