datamade / usaddress

:us: a python library for parsing unstructured United States address strings into address components
https://parserator.datamade.us/usaddress
MIT License
1.52k stars 304 forks source link

ERROR: Unable to tag this string because more than one area of the string has the same label #336

Open amanhoa opened 2 years ago

amanhoa commented 2 years ago

ORIGINAL STRING: 11412 Via Rancho San Diego UNIT 45, El Cajon, CA 92019 PARSED TOKENS: [('11412', 'AddressNumber'), ('Via', 'StreetName'), ('Rancho', 'StreetName'), ('San', 'PlaceName'), ('Diego', 'PlaceName'), ('UNIT', 'OccupancyType'), ('45,', 'OccupancyIdentifier'), ('El', 'PlaceName'), ('Cajon,', 'PlaceName'), ('CA', 'StateName'), ('92019', 'ZipCode')] UNCERTAIN LABEL: PlaceName

When this error is raised, it's likely that either (1) the string is not a valid person/corporation name or (2) some tokens were labeled incorrectly

To report an error in labeling a valid name, open an issue at https://github.com/datamade/usaddress/issues/new - it'll help us continue to improve probablepeople!