Closed tomkdefra closed 7 months ago
@tomkdefra this may be happening if you're re-running the cell with the joins multiple times (you're recursively joining the dataframe onto itself).
The solution could be to run the cells from the point of the first definition of features_df
or to use different variables for each new definition of the dataframe, e.g. features_with_roads_df
Looks like all the dataframe joins are stacking up 'road_length' columns. Can we rename them or is there a cleverer way?
AnalysisException: [AMBIGUOUS_REFERENCE] Reference
road_length
is ambiguous, could be: [road_length
,road_length
,road_length
,road_length
].