snorkel-team / snorkel

A system for quickly generating training data with weak supervision
https://snorkel.org
Apache License 2.0
5.81k stars 857 forks source link

Make snorkel installable as a package #920

Closed dhimmel closed 6 years ago

dhimmel commented 6 years ago

It would be make snorkel more portable if it were installable as a package. I'm happy to try to tackle this today and tomorrow (while we're at Stanford Info Lab). The plan would be to create a setup.py so snorkel could be installed as a package in a versioned way. This hopefully would reduce the need to set environment variables, which has been causing us headaches.

Related content:

@ajratner and @stephenbach: do you know of any reasons that prevent using snorkel as a package rather than submodule?

lukehsiao commented 6 years ago

@dhimmel If you decide to pursue this, you can also refer to Fonduer. We recently made Fonduer pip-installable. Some other things you'll need to watch out for:

I'm sure there's more, but at least those are some things to consider.

dhimmel commented 6 years ago

Great thanks @lukehsiao. I've made some good progress in https://github.com/HazyResearch/snorkel/pull/921. I actually didn't see your comment above until I was deep into that PR --- otherwise I would have seen if we could have cherry picked your fonduer commits.

Anyways, are in the Info Lab at Stanford. If so, want to stop by the computer lab to chat?

stephenbach commented 6 years ago

Fixed in version 0.7. Thanks @dhimmel!