petermr / CEVOpen

Contentmining of Open phytochemical literature for medicinal activities
27 stars 19 forks source link

Manual Analysis of section content and titles #40

Open petermr opened 5 years ago

petermr commented 5 years ago

THIS ACTIVITY IS TO FIND THE SECTIONS IN WHICH KEY INFORMATION IS REPORTED. IT DOES NOT INCLUDE THE INTRODUCTION / BACKGROUND

each of these will be a separate column. There will usually be NO, ONE or possibly 2-4 entries.

ACTIVITY TESTED

We need the actual activities TESTED. There will normally be ONE, maybe TWO, occasionally MORE and sometime NONE Also please record the TITLE of the section where it is recorded. e.g. "analysis of antimicrobial activity"

PROCESS

Need the titles

LOCATION

Need the titles

PLANT

Need the title of sections which mention the plant/s under study

PLANTPARTS

Need the title of sections which mention the plant/s under study

ACTIVITY

Need the title of sections which mention the activity under study

The compounds will be in the table so not required and we can omit instrument at this stage

ambarishK commented 5 years ago

Sir, please check for the format of extraction and approach - oil186manualanalysis20191019.tsv

petermr commented 5 years ago

Example: PMC 5080681 (ALWAYS include "PMC")

Location: Methods >> Collecting and preparing plant materials Palestine

"Location" ONLY applies to where the material was collected, NOT where the equipment comes from

ambarishK commented 5 years ago

OK sir.

ambarishK commented 5 years ago

Sir, check for the updated sheet - oil186manualanalysis20191019.tsv

ambarishK commented 5 years ago

Sir, I have added 50 records to - oil186manualanalysis20191019.tsv. Please go through it.

petermr commented 5 years ago

Thanks I will do that early morning UK time. I have talked to @mannyrules and he will find the information useful. I now have the code to section the paper and will try to commit the results then

On Mon, 21 Oct 2019, 20:12 Ambarish Kumar, notifications@github.com wrote:

Sir, I have added 50 records to - oil186manualanalysis20191019.tsv https://github.com/petermr/CEVOpen/blob/master/project/articleAnalysis/oil186manualanalysis20191019.tsv. Please go through it.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/petermr/CEVOpen/issues/40?email_source=notifications&email_token=AAFTCS77K22LX5HGMTO54N3QPX5IBA5CNFSM4JCQQRF2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEB3ONEI#issuecomment-544663185, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAFTCS3QP6SAJ42QLF2T65DQPX5IBANCNFSM4JCQQRFQ .

ambarishK commented 5 years ago

Sir, please check for the updated sheet for manual analysis of oil186 - oil186manualanalysis20191019.tsv

This contains records for all 186 articles.

petermr commented 5 years ago

Please add PMCIDs to the first field of all non-empty rows in the table.

On Fri, Oct 25, 2019 at 9:06 AM Ambarish Kumar notifications@github.com wrote:

Sir, please check for the updated sheet for manual analysis of oil186 - oil186manualanalysis20191019.tsv https://github.com/petermr/CEVOpen/blob/master/project/articleAnalysis/oil186manualanalysis20191019.tsv

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/petermr/CEVOpen/issues/40?email_source=notifications&email_token=AAFTCS3VAC6HKGFUIH75RELQQKSJDA5CNFSM4JCQQRF2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOECHRYGA#issuecomment-546249752, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAFTCS4UMM5ELZG6O2NX5FTQQKSJDANCNFSM4JCQQRFQ .

-- Peter Murray-Rust Founder ContentMine.org and Reader Emeritus in Molecular Informatics Dept. Of Chemistry, University of Cambridge, CB2 1EW, UK

ambarishK commented 5 years ago

Sir, please go through the updated sheet oil186manualanalysis20191019.tsv. I have removed empty rows and added PMCID in first column.

ambarishK commented 5 years ago

Sir, please go through the cleaned sheet for oil186ManualAnalysis20191019.tsv.

ambarishK commented 5 years ago

Sir, please confirm the format of extraction - activity, method paragraph title, results paragraph title.

oil186manualanalysis20191026.tsv

Column description -

Now moving onto extending the activity_table.csv

ambarishK commented 5 years ago

Sir, please go through the updates - oil186manualanalysis20191028.tsv