Closed dpprdan closed 7 years ago
Thanks @dpprdan well spotted. Both should now be fixed. Can you check ? If so I'll close the issue.
I just did a quick check and it looks fine to me now.
One thing I noticed only now is that quite a few variables have only one or two levels (when they were all factors) or a lot of NAs (now), e.g. gdp_year
, pop_year
, fips_10
, wikipedia
etc. gdp_year == 0
also does not make a lot of sense, does it? So is this data missing in the source or is it an import issue. And if it's something that cannot be fixed, it might be a good idea to drop these variables. Again, this might not be "as close as possible to NE" but what good are these variables if there is not any information in them? But maybe this is a new issue?
Thanks, the missing data in the cases I've looked at are present in the shapefiles from Natural Earth. You might want to raise that up with them. The problems with dropping variables are that the code becomes more complex, has to choose which to drop and would need to be changed if they are fixed. All make the code difficult to maintain.
Why are all attributes factor variables in
ne_countries
?I guess this might be related to "keep the data as close as possible to Natural Earth", but character and numeric make so much more sense, IMHO.
Also missings (-99, -099 (see
un_a3
)), should be coded asNA
, IMO.