datamade / usaddress

:us: a python library for parsing unstructured United States address strings into address components
https://parserator.datamade.us/usaddress
MIT License
1.52k stars 304 forks source link

North and South Dakota Not Recognized as Place Names #330

Open montep opened 2 years ago

montep commented 2 years ago

ERROR: Unable to tag this string because more than one area of the string has the same label

ORIGINAL STRING: 875 Bletchley Rue, Valetta, North Dakota 8342 PARSED TOKENS: [('875', 'AddressNumber'), ('Bletchley', 'StreetName'), ('Rue,', 'StreetNamePostType'), ('Valetta,', 'SubaddressType'), ('North', 'SubaddressIdentifier'), ('Dakota', 'SubaddressType'), ('8342', 'SubaddressIdentifier')] UNCERTAIN LABEL: SubaddressType

ERROR: Unable to tag this string because more than one area of the string has the same label

ORIGINAL STRING: 9323 Aberavon Drive, Berna, South Dakota 8434 PARSED TOKENS: [('9323', 'AddressNumber'), ('Aberavon', 'StreetName'), ('Drive,', 'StreetNamePostType'), ('Berna,', 'SubaddressType'), ('South', 'SubaddressIdentifier'), ('Dakota', 'SubaddressType'), ('8434', 'SubaddressIdentifier')] UNCERTAIN LABEL: SubaddressType