Open fgregg opened 7 years ago
Hey @fgregg, that would be an honor! Definitely a fan of @datamade's work around e.g. policing, housing, and racial justice (a set of topics very close to my heart).
No objections to the above from me. As to the specific points:
A pip-only installation would be amazing. I'd liken the way it currently works to, say, lxml requiring libxml2 to be installed separately. Since libpostal has many bindings, we don't bundle it with any of them, though in general it should be rare that people are using libpostal with multiple language bindings (I can see maybe Python + the Postgres extension). The main concern we have that most other libraries do not is the size of the model downloads. The global parser model currently takes up ~1.8GB of disk/memory, so it's often desirable to specify the location rather than have a potentially non-sensible default (the default datadir in Autotools is something like /usr/local/share, but in a setting like AWS, where root volumes are relatively small, it's often better to use a mounted EBS data volume). It should be fine for the pip install to trigger an install of libpostal, but would still be preferable to have the Python binding play nice with a system installation if the user wants that, especially considering the hefty downloads, and being mindful that not all of our users have access to cheap broadband connections or unrestricted downloads.
It would definitely be possible to create Debian/Red Hat/Homebrew packages. We don't currently support Windows, but I've removed the problematic dependencies, and one user reported getting it working here: https://github.com/BenK10/libpostal_windows. Would ideally like to get compilation to work with Visual Studio and put up an Appveyor build if Windows support is to be a thing. I personally don't have any Windows machines available at the moment. In any case, if you guys are willing to help with packaging that would be awesome! I've been working mostly on core thus far and for installation have just assumed that a standard Autotools source build (configure/make/make install) is familiar enough to most *nix users.
Did anything ever happen with this? We are currently trying to decide between pypostal and usaddress. We need good windows support which seems to be a struggle with pypostal, but it seems like usaddress doesn't have much activity (maybe a dying project?).
FYI I have done some work towards making pypostal installable just with pip over in #76. I am not sure what direction the project maintainers want to take this in, or if there even is energy for a major packaging change at this stage in the project's lifecycle.
hi @thatdatabaseguy ,
At @datamade, we are increasingly in need of a multinational version of usaddress. Now that libpostal has moved to a CRF model, it seems a little silly to not try to combine our efforts. Before we can do that we need
Are these things you would consider? @datamade would do the necessary work to make these happen.