Open imarios opened 7 years ago
Adding myself as a placeholder. I would love to get help if anyone is interested.
I'd like to give this a go, if no one else has already started on it.
Hi @jacobBaumbach ! sorry for the delay! we would love your help. Feel free to ask for any help. Best!
Currently we have support for UDF only. UDAF are really useful and it will be nice if we can support them.
Relevant scala docs and this. The second seems to be the preferred way to do this with Datasets.
The Aggregator:
Example from SO:
Examples from spark
Other example from Databricks:
todo: Create an TypedAggregator that returns a TypedColumn.
The entire construct seems to be quite type safe. This could be really simple.