Open dhimmel opened 7 years ago
@danich1, for guidance check out the following (private communication by @ajratner):
If interested, you can also see the chemical-disease relation task I was mentioning. We're still cleaning it up, adding explanatory comments, etc--it's not in tutorial form yet--but it's the most up-to-date application we currently have posted online: https://github.com/HazyResearch/snorkel/tree/cdr-tutorial/tutorials/cdr
Seems like a good task to me! Further discussion around (a) chemical vs. "compound" and (b) initial relation type on other threads
I met @thodrek (Theo Rekatsinas) at the Moore Early Career Symposium in Hawaii. He's part of HazyResearch and helped me understand snorkel. We also snorkeled in real life too and saw a :turtle:.
I learned that snorkel will extract sentences that either support a positive (+1) or negative (-1) relationship. However, we will have to do an additional step to "fuse sentences" into a single judgement or score on whether a given relationship exists. @thodrek's paper on SLiMFast is a good place to look for inspiration (blog post).
@thodrek, thanks for the help. Let us know whether you're interesting in contributing to this project and we can add you to the repo. Contributions could be giving feedback on GitHub issues, pull request review as requested, or anything else valuable. Cheers.
@danich1 is a new rotation student in the Greene Lab :champagne:. We were thinking a good rotation project would be to extract medical indications from the literature. The project is intended as a pilot. The ultimate goal is to automate the integration of all biological knowledge into a single hetnet.
We were thinking Compound–treats–Disease relationships were a good place to start for the following reasons:
Project Rephetio used Disease Ontology as its disease vocabulary and DrugBank as its drug vocabulary. But we're flexible here.
Paging @ajratner and @stephenbach -- creators of Snorkel.