Closed jacob-talroo closed 5 months ago
The geometry
type data will not be able to be written to delta lake because delta lake has no geometry
type.
It was not supposed to work actually. The correct way to save a geometry column to Delta lake is ST_EWKB
ST_EWKT
and read it back using ST_GeomFromWKB
and ST_GeomFromWKT
Oops. I had a bad test result. I think the issue is DBR 15.3 Beta. DBR 15.2 appears to be working. I'll update the title/description above.
I will be surprised if this is working. Not sure how it could work in the first place
I can confirm that this IS a change on DBR 15.3 Beta. I'll file a defect with Databricks.
@Kontinuation seems like the Databricks DeltaLake can read / write Sedona Geometry?
I couldn't write the geometry --- so I am writing the WKB in < DBR 15.2.
However, in DBR 15.3, all writes fail.
FYI - Databricks provided the following workaround - remove spark.serializer org.apache.spark.serializer.KryoSerializer
from the cluster config.
I believe they are working on a real fix now.
Expected behavior
Writing out a delta lake should work.
Actual behavior
Fails with:
Steps to reproduce the problem
Launch a DBR 15.3 Beta with Sedona. Run:
I am hopeful that the stack trace might be helpful to someone.
Settings
Sedona version = 1.5.3 and 1.60
Apache Spark version = 3.5
API type = Scala
Scala version = 2.12
JRE version = 17
Python version = 3.11.0
Environment = Databricks 15.3 beta