Open rsingh2083 opened 7 years ago
Hey @rsingh2083,
Thanks for filing this! That's a real doozy of an address. I haven't been able to figure out what it's referring to.
If you can confirm that this is a valid address pattern, we'd be happy to bring it in as training data. We'll need 4-5 more examples of the pattern to be able to train the model reliably.
I get similar error for this address : 9234 N Loop 1604 W San Antonio TX 78249
Hey @gl-ronak,
Can you tell me how you were expecting that address to be parsed? In particular, what does the second set of numerics (1604
) refer to?
If you can find 3-4 more examples of this pattern, we'd be glad to bring it in as training data.
I just experienced a similar issue.
usaddress.RepeatedLabelError:
ERROR: Unable to tag this string because more than one area of the string has the same label
ORIGINAL STRING: 1407 7 Ave NW, Calgary, AB T2N 0Z3, Canada
PARSED TOKENS: [('1407', 'AddressNumber'), ('7', 'StreetName'), ('Ave', 'StreetNamePostType'), ('NW,', 'StreetNamePostDirectional'), ('Calgary,', 'PlaceName'), ('AB', 'StateName'), ('T2N', 'OccupancyIdentifier'), ('0Z3,', 'OccupancyIdentifier'), ('Canada', 'PlaceName')]
UNCERTAIN LABEL: PlaceName
I'm using this library to automate the parsing of data from Google Maps to input into a SF db of organizations we work with. I'm I think I see where the error occurred , Calgary,
, however it is a Canadian address so that could be normal?
@NoahCardoza I think in this case there are actually two things going on:
OccupancyIdentifier
, which is probably throwing off the tagging of Canada
and causing it to get tagged as a repeated PlaceName
Canada
string anywayI was able to get a slightly more sensible parse by removing Canada
from the end of the string:
>>> usaddress.tag('1407 7 Ave NW, Calgary, AB T2N 0Z3')
(OrderedDict([('AddressNumber', '1407'), ('StreetName', '7'), ('StreetNamePostType', 'Ave'), ('StreetNamePostDirectional', 'NW'), ('PlaceName', 'Calgary'), ('StateName', 'AB T2N'), ('ZipCode', '0Z3')]), 'Street Address')
Ah, that should probably be enough. We don't have many organizations in CA, however, what are your thoughts on https://github.com/datamade/usaddress/pull/254? I'm assuming you might not be merging it seeing as the name of this project is usaddress?
I don't expect we'll support Canadian addresses in the near future, but if you'd like to support them you might try training your own model using the supplemental training data in #254.
Hello,
I just encountered an error
ORIGINAL STRING: Bronx, New York City PARSED TOKENS: [('Bronx,', 'PlaceName'), ('New', 'StateName'), ('York', 'PlaceName'), ('City', 'PlaceName')] UNCERTAIN LABEL: PlaceName
Seems to be a valid place https://www.britannica.com/place/Bronx-borough-New-York-City
While tagging this
usaddress.tag('Mr. Robbie Thomson,Cal. Hosp 2,Street 11, Block H,Jersey, New Jersey 121889,United States')
Im getting this error : -