geomarker-io / addr

Clean, Parse, Harmonize, Match, and Geocode Messy Real-World Addresses
https://geomarker.io/addr/
Other
2 stars 0 forks source link

geocoding improvements #22

Closed cole-brokamp closed 3 weeks ago

cole-brokamp commented 1 month ago
erikarasnick commented 1 month ago

2400 block of Colerain Ave was matched to the entire length of Colerain Ave. Upon further inspection, I don't think the 2400 block exists... the street is not continuous and jumps from 2100s to 2600s. When our desired outcome is a matched street range geography (as in xx_address) would we want a case like this to return missing? Or the nearest street range?

cole-brokamp commented 1 month ago

Returning the numerically nearest street range would be a good option to provide instead of the union when the street is matched but the ranges do not contain the street number.

edit: tracking this specific change in #29

erikarasnick commented 1 month ago

I noticed when using addr_match_tiger_street_ranges some addrs are not getting matched even when the street does exist in the TIGER street ranges.

e.g., addr TIGER street name
2250 Westwood Northern Boulevard Westwood Northern Blvd
1050 William H Taft Road William Howard Taft Rd
2650 Richwill Court Richwill Ct

The street suffixes are abbreviated in TIGER and not addr, but this doesn't cause a problem for lots of other addresses. Could this be related to the "number of edits" matching?

I also don't see any interstates in the TIGER street ranges. Are those getting filtered out?

cole-brokamp commented 1 month ago

Compare the address tags in those two cases. Maybe getting parsed incorrectly?

erikarasnick commented 1 month ago

yes, so this is related to #27. will track there.

except the Taft one is just the difference in "H" vs "Howard"

cole-brokamp commented 3 weeks ago

now that we have this broken up into more modular issues, closing tracking here.