FRosner / spawncamping-dds

Data-Driven Spark allows quick data exploration based on Apache Spark.
Other
28 stars 15 forks source link

Replace aggregators by UDAFs where possible #292

Open FRosner opened 8 years ago

FRosner commented 8 years ago

Description

If we do aggregations on data frames, we should avoid using aggregators that require an RDD (df.rdd.aggregate(...)). If possible, we should replace all these patterns by UDAFs, which will simplify the code a lot because we don't have to work with Row objects anymore.

Prerequisites