ccao-data / data-architecture

Codebase for CCAO data infrastructure construction and management
https://ccao-data.github.io/data-architecture/
5 stars 3 forks source link

Add new model features to capture parcel shape #522

Open dfsnow opened 3 days ago

dfsnow commented 3 days ago

This PR updates the parcel file ETL to add 6 new features for modeling. These features attempt to capture the complexity of the parcel shape, based on the idea that more "regular" shaped parcels are more desirable and are thus worth more, they were mostly taken from this paper. The 6 features are:

  1. The standard deviation of the interior angles of the parcel polygon
  2. The standard deviation of the parcel polygon edge length
  3. The standard deviation of the parcel vertices' distance to the parcel centroid
  4. The total number of vertices in the parcel polygon
  5. The ratio of the length of the sides of the minimum spanning rotated rectangle
  6. The ratio of the areas of the minimum spanning rotated rectangle and the parcel polygon itself

I haven't actually run this script yet to replace any cleaned up prod parcel files, but I'll do so once this is merged.

I'll make a follow-up PR to add the output features to dbt and all the necessary views/docs.