When the groupBy() transformation is performed on a DataFrame, it says a special GroupedData object is returned. Is the dataDF object actually altered (changed to a GroupedData object) or would you have to assign the result of dataDF.groupBy() to another variable?
Additionally, all of the code examples below show the groupBy() function immediately followed by an aggregation function. Do these aggregations work similarly to actions, in that a transformation (like groupBy()) does not occur until an aggregation is performed?
When the groupBy() transformation is performed on a DataFrame, it says a special GroupedData object is returned. Is the dataDF object actually altered (changed to a GroupedData object) or would you have to assign the result of dataDF.groupBy() to another variable?
Additionally, all of the code examples below show the groupBy() function immediately followed by an aggregation function. Do these aggregations work similarly to actions, in that a transformation (like groupBy()) does not occur until an aggregation is performed?