microsoft / GlobalMLBuildingFootprints

Worldwide building footprints derived from satellite imagery
Other
1.35k stars 201 forks source link

Rename the data files in .gz files from .csv to .ndjson #61

Open chaitanyach opened 1 year ago

chaitanyach commented 1 year ago

The data is in new line delimited JSON format (https://gdal.org/drivers/vector/geojsonseq.html) GIS software can seamlessly read the files if they are named as .ndjson indicating that they are of that format. Since there is only one field per line and no header, this is not really a CSV format. At least mention in the docs that changing the extension helps in opening the data by GDAL supported software. There is no need to run a python script.

msmitherdc commented 1 year ago

this can be done in gdal without changing the extension. One can do

geojsonseq:/vsigzip//vsicurl/https://minedbuildings.blob.core.windows.net/global-buildings/2023-06-06/global-buildings.geojsonl/RegionName%3DKazakhstan/quadkey%3D121220301/part-00166-e6b13dc0-a501-4630-bc51-aa2e3483e114.c000.csv.gz

for example