opentargets / issues

Issue tracker for Open Targets Platform and Open Targets Genetics Portal
https://platform.opentargets.org https://genetics.opentargets.org
Apache License 2.0
12 stars 2 forks source link

Handle literature mining from preprints and patents #2879

Open ireneisdoomed opened 1 year ago

ireneisdoomed commented 1 year ago

We want to accommodate the new stream of findings we extract from preprints and patents through EPMC's mining pipeline, on top of the existing literature references.

Background

EPMC's new pipeline is submitting to us the results of running their entity recognition algorithm on pre print publications and patents submitted to the relevant national institutions such as the European Patent Office.

These are the numbers of evidence per data type that we will be incorporating in 23.02:

+----------+-------+
|  lit_type|  count|
+----------+-------+
|Literature|5131169|
| Preprints|  97951|
|   Patents|  94120|
+----------+-------+

Note: These additions not only affect EPMC's evidence, but the bibliography widget so that these are IDs will be now queryable.

Tasks

prashantuniyal02 commented 1 year ago

Adding few suggestions here: