microsoft / RoadDetections

Road detections from Microsoft Maps aerial imagery
Other
510 stars 25 forks source link

6,416,947 invalid linestrings in _USA.tsv #19

Open jmealo opened 2 months ago

jmealo commented 2 months ago

Hello,

There's a large number of invalid linestrings in the _USA.tsv file that only contain a single point or are self-referencing.

Only about 88% of the rows in the US file are valid line strings.

ST_IsValid returns false for an additional: 211,991 rows on top of that

Additionally, providing the file as a WKT representation per line would be easier for many folks to load into their database and is much smaller on disk.

Thanks, Jeff

jmealo commented 2 months ago

This is a duplicate of #11