lubianat / ann

A repository for brainstorming and prototyping ideas related to the eLifeSprint project Annotate them all (https://sprint.elifesciences.org/annotate-them-all/)
Apache License 2.0
14 stars 0 forks source link

Enhance Open Tapioca for biological concepts #17

Open lubianat opened 4 years ago

lubianat commented 4 years ago

What is your idea? Open Tapioca is a nice matcher to Wikidata, but it only works for the names of people, organizations and places. It would be cool if we could use it for biological concepts!

What can we do at the Sprint? Dig into the code of Open Tapíoca and figure out a way of making the Natural Language Processing (NLP) algorithm detect genes (or disease, or proteins).

This would be a cool "deliverable", because it is both relevant for this project and integrated to an external tool (immediate impact!)

What skills does it require?