microsoft / PlanetaryComputer

Issues, discussions, and information about the Microsoft Planetary Computer
https://planetarycomputer.microsoft.com/
MIT License
185 stars 9 forks source link

entire grids with missing buildings in building footprints #357

Open virgilxw opened 5 months ago

virgilxw commented 5 months ago

Hi - it seems there are large numbers of missing buildings from the Netherlands in the buildings-footprint dataset

catalog = pystac_client.Client.open(

<img width="470" alt="Screenshot 2024-06-05 at 08 51 15" src="https://github.com/microsoft/PlanetaryComputer/assets/14037656/36807ef2-d8da-4786-a13f-5caf4e4a3737">
  "https://planetarycomputer.microsoft.com/api/stac/v1",
      modifier=planetary_computer.sign_inplace,
  )
  collection = catalog.get_collection("ms-buildings")

  asset = collection.assets["delta"]

  storage_options = {
      "account_name": asset.extra_fields["table:storage_options"]["account_name"],
      "sas_token": asset.extra_fields["table:storage_options"]["credential"],
  }
  table = deltalake.DeltaTable(asset.href, storage_options=storage_options)

  quadkeys = [
      int(mercantile.quadkey(tile))
      for tile in mercantile.tiles(*buffer[0].bounds, zooms=9)
  ]
  quadkeys

  file_uris = table.file_uris([("RegionName", "=", "Netherlands")])
  df = pd.concat(
      [
          gpd.read_parquet(file_uri, storage_options=storage_options)
          for file_uri in file_uris
      ]
  )
777arc commented 2 weeks ago

@virgilxw apologies for the delay, before we dive into this issue is there any additional information you have found over the last few months?