ward-wise / data-analysis

Data analysis on Chicago infrastructure and infrastructure spending
MIT License
3 stars 7 forks source link

improve geocoding process flow and modularity #46

Open kollerbud opened 1 week ago

kollerbud commented 1 week ago

current workflow to add new addresses format processor is a bit convoluted. Example, changes need to be made to process "W BRAYTON ST & S STATE ST (0 E) & W 126 PL (120 W)", in order to geocode:

  1. location_format_processing.py: potentially new Regex pattern, then add new type to "LocationFormat", new function to break down to string to individual streets
  2. location_geocoding.py: update LocationGeocoder to accept the new LocationFormat type, then function to output (potentially) new Geometry
  3. geocoder_api/geocoder_local: also output Geometry shape

Idea: consolidate responsibilities between scripts, consistent with data types in location_structures.py: