-
## Adding a Dataset
- **Name:** *Genetic Association Database Corpus*
- **Description:** *a corpus identifying associations between genes and diseases by a semi-automatic annotation procedure based …
-
The simple alignment scoring we introduced in #6 does not catch missed coreferent mentions. This will lead to some less-than-perfect alignments with scores of 1.0. I don't currently have a proposal to…
-
Sometimes, PubTator misses annotations that cause an alignment to fail. Ideally, we could extend PubTators annotation using another service, like [scispacy](https://allenai.github.io/scispacy/). Assum…
-
I am finding a number of mismatches in the organism ID for the same PMID between BioGRID and PubTator. Take for instance, https://pubmed.ncbi.nlm.nih.gov/10924150/, where PubTator correctly identifies…
-
The quick-start is a little too brief. Here's a sketch of a tutorial notebook (possibly split into multiple sections):
1. `simple_annotator` in action
2. under the hood: `SupervisableDataset`
3. …
-
In training scrpit, DATASETS.TRAIN is set ts 20DS_VG_ CC / VG KB_train. However, experimental setting says "during training distant supervision is performed using the intersection of relations from V…
-
Hi,
I’ve updated a latest 19.04 plateforme to 20.04.13 (following this documentation: https://docs.centreon.com/20.04/fr/upgrade/upgrade-from-19-04.html). While I had no problem doing so on another…
-
Hi,
I have several questions regarding the implementation of distant supervision of the ranker:
1. Which variant of BM25 did you use? Would be great if you could provide a pointer to the package y…
-
**Dataset Information:**
We have extracted anchor text pointing to documents in MS MARCO (version 1 and version 2) from several Common Crawl snapshots that can be used as additional retrieval featu…
-
您好,请问一下对于Distant Supervision场景时,使用什么做测试集呢?