This PR swaps ScispaCy for PubTator for pipeline-based entity hinting. This requires less code and dependencies (and is also significantly faster) but should lead to very similar performance.
Going to re-run all experiments and make sure results are as expected before merging.
Other changes
♻️ Drops the align and preprocess utils, moves them all into common/util.py
This PR swaps ScispaCy for PubTator for pipeline-based entity hinting. This requires less code and dependencies (and is also significantly faster) but should lead to very similar performance.
Going to re-run all experiments and make sure results are as expected before merging.
Other changes
align
andpreprocess
utils, moves them all intocommon/util.py
Closes
Closes #34.