datamade / usaddress

:us: a python library for parsing unstructured United States address strings into address components
https://parserator.datamade.us/usaddress
MIT License
1.5k stars 302 forks source link

ERROR: Unable to tag this string because more than one area of the string has the same label #341

Open maning opened 1 year ago

maning commented 1 year ago

ERROR: Unable to tag this string because more than one area of the string has the same label

ORIGINAL STRING: 2105 New Jersey Highway 35, Middletown, New Jersey 07748, USA PARSED TOKENS: [('2105', 'AddressNumber'), ('New', 'StreetName'), ('Jersey', 'StreetNamePostType'), ('Highway', 'StreetNamePreType'), ('35,', 'StreetName'), ('Middletown,', 'PlaceName'), ('New', 'StateName'), ('Jersey', 'StateName'), ('07748,', 'ZipCode'), ('USA', 'CountryName')] UNCERTAIN LABEL: StreetName

When this error is raised, it's likely that either (1) the string is not a valid person/corporation name or (2) some tokens were labeled incorrectly

To report an error in labeling a valid name, open an issue at https://github.com/datamade/usaddress/issues/new - it'll help us continue to improve probablepeople!

For more information, see the documentation at https://usaddress.readthedocs.io/