dimfalk / kostra2010R

R interface for KOSTRA-DWD-2010R dataset
GNU General Public License v3.0
2 stars 0 forks source link

`vg250_gem_centroids.rda`: tidy data, remove duplicates #33

Closed dimfalk closed 1 year ago

dimfalk commented 1 year ago

vg250_gem_centroids$GEN |> table() |> sort(decreasing = TRUE) shows a lot of non-distinct objects, e.g. "Neuenkirchen" is represented 12 times. Multipolygons?

dimfalk commented 1 year ago

Apparently, "Neuenkirchen" is the most popular town municipality name in (northern) Germany. Seems like filtering by name is far from generating unique results in this case.

However, there are also multiple polgons in the dataset like "Kiel". Waterfront?

dimfalk commented 1 year ago

VG250_GEM seems to have some sea areas included. Distinction via attributes is not possible, making a secondary dataset for filtering necessary.

Alternative:

VG250_GEM (municipalities as polygons): 11,123 objects VG250_PK (municipalities as points): 10,994 objects - seems to be cleaned for these areas.