openvenues / libpostal

A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
MIT License
4.08k stars 421 forks source link

Question: Should I expand_address before parsing? #642

Open piotratlz opened 1 year ago

piotratlz commented 1 year ago

Hi

As it was written in issue title I wonder if I it is recommended to expand address before running parse_address? My main goal is to identify unique addresses in dataset and it seems natural to first normalise an address and then parse it to extract labeled parts.

albarrentine commented 9 months ago

Usually the other way around, parse then expand each component with its specific dictionaries. See the https://github.com/openvenues/lieu project for some examples in Python