Closed CloudNiner closed 4 years ago
I had a miserable time emulating RDD.groupByKey in DataFrame. I'd be very curious if you have suggestions on how to avoid dropping to RDDs for this particular aggregation now that there's a working RDD implementation to compare against.
I had a miserable time emulating RDD.groupByKey in DataFrame. I'd be very curious if you have suggestions on how to avoid dropping to RDDs for this particular aggregation now that there's a working RDD implementation to compare against.