zero-one-group / geni

A Clojure dataframe library that runs on Spark
Apache License 2.0
281 stars 28 forks source link

Support Spark UDF #306

Open andres-moreno opened 3 years ago

andres-moreno commented 3 years ago

Geni users would benefit from support for Spark User Defined Functions on dataframes as documented here.

UDFs are very useful for data analysis from the simple classification of continuous values to implementing models that operate on rows of values (e.g., modelling the impact on sales as a function of own and competitor price changes) to cleansing data using the values of multiple columns.