data-derp / small-exercises

6 stars 9 forks source link

[Content] [Wrangling in Spark] (02 - 06: 11: Partitioning and BucketingTimestamps and Math) #25

Closed kelseymok closed 1 year ago

kelseymok commented 1 year ago

"In cell no 23 there's an error data, doesn't contain EventId -> rename the data variable to plane_data Full line should be: plane_data_selected = plane_data.select(*[col for col in plane_data.columns if col not in ['year']], plane_data.year.cast('integer'))"