bcgov / ols-geocoder

Physical Address Geocoder
Apache License 2.0
10 stars 6 forks source link

Missing addresses not found in rejected list #371

Open BK01 opened 7 months ago

BK01 commented 7 months ago

Scenario: *Geocoded in PROD with October 2023 data vintage.

Searching for the following addressString (50 Clark Dr, Port Alice, BC) results in a BLOCK level match with a fault of STREET_NAME.spelledWrong. The resulting fullAddress includes an updated street name of Clarke Dr.

Task: Investigate why the addresses on Clark Dr were not found in the rejected address SHPs.

Initial Review:

It was found that BCA, AddressBC and the street sign shown in Google Street View use the street name ‘Clark’. BCA's assessment search tool also lists ’50 CLARK DR PORT ALICE’ with a PID. The PID was also confirmed in the PMBC and BCA input files. This street name discrepancy has been submitted to the road network team for review.

Based on the street name difference, it was no surprise that 50 Clark Dr, Port Alice, BC was not listed in the site_Hybrid_geocoder file. However, it was also not listed in the rejected address SHPs.

Next, the PMBC layer was examined and showed that this location is a strata lot. More specifically, there were 102 parcel polygons stacked at this location. 101 parcels had a unique PID and an owner type of Private. One parcel was unclassified and had no PID.

Performing a within search (in PROD) of this parcel revealed 69 sites at -127.4830565, 50.4290903. Based on the site_Hybrid_geocoder file they each have a unique PID.

Regarding the address range, most addresses appear to be within the correct overall range. However, the road network shows that part of Clarke Dr only has addresses on the left side of the street (right side range is 0 - 0), while having addresses on both sides of the street in another section. This corresponds with the residential driveways seen in aerial photos and Google Street View.

Finally, as a test the source addresses from AddressBC found within the associated parcel were Batch Geocoded. Of the 99 addresses, 100% of the non-Clark Dr addresses (67x) were found with a matchPrecision of CIVIC_NUMBER. The Clark Dr addresses only matched to the BLOCK level (30x) and the STREET level (2x - CIVIC_NUMBER.notInAnyBlock).

gleeming commented 7 months ago

Some notes to pass on from my investigation

TBD

BK01 commented 7 months ago

Thank you for looking into this further and providing notes on the remaining steps. By searching the contents of site_loader_prep_rejected.csv I can confirm that the addresses on Clark Drive are listed with a fault of STREET_NAME.spelledWrong.