openvenues / libpostal

A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
MIT License
4.04k stars 417 forks source link

Italia returns as country while not iso 3166 #644

Closed manycoding closed 6 months ago

manycoding commented 11 months ago

Hi!

I was checking out libpostal, and saw something that could be improved. E.g. for a simple address "Via Mazzacavallo, 8 43122 Parma PR Italia" lib postal returns country as Italia, but that's not 3166

I'd expect Italy.

albarrentine commented 7 months ago

libpostal is an NLP model that parses the input given. Cross-lingual normalization of country names/ISO 3166 code lookups are a somewhat easier database-oriented task and can be done e.g. with GeoNames, etc.