s22s / pre-lt-raster-frames

Spark DataFrames for earth observation data
http://rasterframes.io
Other
19 stars 5 forks source link

Add PySpark support #17

Open metasim opened 7 years ago

metasim commented 7 years ago

Currently the UDTs work fine in PySpark, and UDFs if called from SQL context. Need to figure out the Python constructs and configuration necessary to register UDFs and UDAFs with the Python DataFrame API.

Support will need to be done in two phases. The first addressing the UDF/UDAF registration/declaration in the DataFrame context. The second is to figure out how to declare and enforce the RasterFrame constraints around spatial key, tile layer metadata, and tiles. This is likely much harder without a type system, perhaps requiring custom Python wrappers.

metasim commented 7 years ago

WIP: https://github.com/s22s/raster-frames/tree/feature/python-api