jakelever / civicmine

Text mining cancer biomarkers for the CIVIC database
http://bionlp.bcgsc.ca/civicmine
MIT License
22 stars 1 forks source link

Annotated Sentences in CivicMine #3

Closed iCHAIT closed 4 years ago

iCHAIT commented 4 years ago

HI @jakelever ,

I read the CivicMine paper and came across the following mentioned in the Abstract -

To this end, a group of cancer genomics experts annotated biomarkers and their clinical associations discussed in 800 sentences and achieved good inter-annotator agreement

May I know if the corpus linked by @swartchris8 in #2 (https://github.com/jakelever/cancermine/blob/master/data/cancermine_corpus.zip) is the one that contains these 800 sentences? Were these sentences were manually annotated by experts?

If not can you point me to the corpus of these 800 manually annotated sentences?

jakelever commented 4 years ago

Hi, thanks for the question. I noticed some confusion in the README so fixed it and put in a link to the corpus. The 800 sentences for CIViCmine can be found in the data/ of this repo.

iCHAIT commented 4 years ago

Thank you @jakelever 👍