NYCPlanning / data-engineering-qaqc

streamlit app for data engineering
https://edm-data-engineering.nycplanningdigital.com
1 stars 0 forks source link

PLUTO outlier report for bldgarea/lotarea > numfloors*2 #205

Open lseirup opened 2 years ago

lseirup commented 2 years ago

This query should be changed to "(BldgArea/NumFloors) < 100 and BldgArea <> 0 and NumFloors >= 10"

The goal is to find egregious errors in NumFloors. The query values are somewhat arbitrary, finding buildings that have less than 10 sq ft per floor and at least 10 floors tall. However, it finds 5 outliers in 22v2 with no false positives. Manual corrections will be added for these lots.

Text box should read "This table displays BBLs with a value for NumFloors that is not feasible given the BldgArea. The query used is (BldgArea/NumFloors) < 100 and BldgArea <> 0 and NumFloors >= 10."