openvenues / libpostal

A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
MIT License
4.08k stars 421 forks source link

Italian misspelled or missing tokens #665

Open opk12 opened 5 months ago

opk12 commented 5 months ago

Hi!

The OpenStreetMap Italia community was checking out libpostal, and saw something that could be improved.


My country is

Italy


Here's how I'm using libpostal

libpostal was recently discussed in the OpenStreetMap Italia chatroom. I am not familiar with the libpostal internals, but here are a misspelled token and some potentially missing tokens in the dictionaries, which you may want to check out.


Here's what I think could be improved

  1. resources/dictionaries/it/number.txt These forms are also common in Italian: n. - nr. - nr
  2. In resources/dictionaries/it/street_types.txt, 4.1. via (line 129) is commonly abbreviated as v. 4.2. via provinciale (line 133) is typo'ed as via provincale.