Open pitkant opened 2 years ago
If this can be easily retrieved otherwise when needed (example, maybe?) then I guess it could also removed.
Addressed partly in v4-dev branch and PR #264. geo
column is now marked in get_eurostat_geospatial
function documentation as "Questioning", offering us some more time to discuss whether we should remove it or keep it in the future.
I will close this issue when v4 is released
Currently different geospatial datasets have the following columns:
Of these, at least in years 2016 and 2021, the following variables contain identical information:
id
,NUTS_ID
,FID
andgeo
. Theid
column is the unique identifier from geojson and not included in the csv file. Thegeo
column is generated at the end ofget_eurostat_geospatial
"for easier joins with dplyr", as well as in data generation scriptdata_spatial.R
.While some of this overlap is due to eurostat data itself containing duplicated columns, is
geo
column still necessary?