JohnGiorgi / seq2rel-ds

This is a companion repository to seq2rel (https://github.com/JohnGiorgi/seq2rel) which aims to make it easy to generate training data.
5 stars 1 forks source link

Add back scispacy support #8

Closed JohnGiorgi closed 2 years ago

JohnGiorgi commented 3 years ago

Sometimes, PubTator misses annotations that cause an alignment to fail. Ideally, we could extend PubTators annotation using another service, like scispacy. Assuming that scispacy at least occasionally catches annotations that are missed by PubTator, this would lead to less missed alignments and therefore more training data. It would also likely improve the quality of the alignments, as it should lead to less missed interactions and coreferent mentions.

JohnGiorgi commented 2 years ago

Closing as I am not longer working on distant supervision with BioGRID.