datamade / usaddress

:us: a python library for parsing unstructured United States address strings into address components
https://parserator.datamade.us/usaddress
MIT License
1.51k stars 303 forks source link

Incorrectly Parsing 'Rhode Island' from multiple valid addresses #304

Open dancorc27 opened 3 years ago

dancorc27 commented 3 years ago

Hi, I'm parsing multiple addresses in Rhode Island and am seeing the state name consistently parsed incorrectly as 'Island'. Some sample addresses:


180 Waterwheel Ln North Kingstown Rhode Island 02852 United States
510 Old Baptist Rd North Kingstown Rhode Island 02852 United States
50 Pleasant St North Kingstown Rhode Island 02852 United States

Example output for '180 Waterwheel Ln North Kingstown Rhode Island 02852 United States':

import usaddress

usaddress.tag('180 Waterwheel Ln North Kingstown Rhode Island 02852 United States')

Output;

(OrderedDict([('AddressNumber', '180'),
              ('StreetName', 'Waterwheel'),
              ('StreetNamePostType', 'Ln'),
              ('PlaceName', 'North Kingstown Rhode'),
              ('StateName', 'Island'),
              ('ZipCode', '02852'),
              ('CountryName', 'United States')]),
 'Street Address')

Thank you!