MattTriano / analytics_data_where_house

An analytics engineering sandbox focusing on real estates prices in Cook County, IL
https://docs.analytics-data-where-house.dev/
GNU Affero General Public License v3.0
9 stars 0 forks source link

Refactor _standardized stage scripts to clean col-values before making them into a composite key #121

Closed MattTriano closed 1 year ago

MattTriano commented 1 year ago

The NYC Property Sales data set includes commas in the sale prices, and the best composite key I could find included the sale price. At present, I form a composite key before cleaning the sale_price, which will make it hard to remake the composite key. It would be much better to just apply cleaning before making the composite key.