petermr / CEVOpen

Contentmining of Open phytochemical literature for medicinal activities
27 stars 19 forks source link

Create processing dictionary #37

Open petermr opened 5 years ago

petermr commented 5 years ago

This dictionary will be small but contain the methods used ot process the plant and extract the oils:

Thyme was harvested during the >flowering season<

5080681 The aerial plant parts (leaves, stems and flowers) were collected during its> flowering time<
5132230 >dried< and >crushed< leaves
5203915 Hairy roots (HR) and the roots of >soil-grown< plants (SGR)
5237462 The leaves were treated (>washed< and >dried<)
5248495 from the seeds sown in the greenhouse, with subsequent transplantation of the seedlings to the same field, in the Kotayk Region of Armenia, where they have been growing side by side, at an elevation of 1600 m above the sea level. Plant materials were collected during >blossoming period<
5282690 A. campestris L. was collected at >flowering stage< in September 2012
5307246 >Ripe< fruits of L. kerstingii
5307902 The >fresh< leaves of P. amboinicus were >extracted by steam<
5324201 Whole plants
5330108 Leaves are >washed< thoroughly, >dried< in shade, and >powdered<
5344628 >dried< floral buds
5364420 >dried< C. rotundus rhizomes
5393100 Extraction of the fruits was performed using >boiling water<
5397855 
5411863 All samples were collected at full >flowering stage< for species identification and fruit maturing stage for essential oil analyses
5412227 Flowering, aerial parts of >wild< Dracocephalum kotschyi Boiss
5423258 the >fresh< aerial parts
5427463 Leaves of S. officinalis L.
5448358 aerial parts (stems, leaves, and flowering tops) and the roots
5454990 from leaves, the branches
5485486 The leaves
5486035 ( v/ w >fresh< material)
petermr commented 5 years ago

Have created a small dictionary of processing terms

https://github.com/petermr/CEVOpen/tree/master/dictionary/process

This should include

ambarishK commented 5 years ago

Sir, one more information can be included - Isolation of essential oils. For example - The EO was obtained by hydrodistillation.

Processing rest others.

petermr commented 5 years ago

Quite right. I just took the terms from your 25-article subset. Please add additional terms in "raw" directory And also try to look them up in Wikidata (and add the description found there)

On Sun, Oct 13, 2019 at 5:35 PM Ambarish Kumar notifications@github.com wrote:

Sir, one more information can be included - Isolation of essential oils. For example - The EO was obtained by hydrodistillation.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/petermr/CEVOpen/issues/37?email_source=notifications&email_token=AAFTCS3XBO5J7JWLYTYOETTQONE3XA5CNFSM4JAGFXHKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBC2AUY#issuecomment-541433939, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAFTCS4HRHWPADKOCYCCABDQONE3XANCNFSM4JAGFXHA .

-- Peter Murray-Rust Founder ContentMine.org and Reader Emeritus in Molecular Informatics Dept. Of Chemistry, University of Cambridge, CB2 1EW, UK

ambarishK commented 5 years ago

Yes sir.

petermr commented 5 years ago

Please go through oil186 and add more processing terms. Also you will find some more terms from E1.0.

ambarishK commented 5 years ago

Sir, please go through the process dictionary file - process20191014.xml

Script to prepare process dictionary - process20191014.sh

Total records - 17.

Collecting more terms from E1.0

ambarishK commented 5 years ago

Sir, please check for the list of extracted process from E1.0

processoil120191015.tsv

Total count - 112.

Please clean the terms.

ambarishK commented 5 years ago

Sir, please check for the updated sheet for extracted process terms - processoil18620191015.tsv

I have fixed all typos and there is added wikidata id column.

Dictionary for process terms is processoil18620191018.xml

petermr commented 5 years ago

On Fri, Oct 18, 2019 at 9:51 AM Ambarish Kumar notifications@github.com wrote:

Sir, please check for the updated sheet for extracted process terms - processE1.020191015.tsv https://github.com/petermr/CEVOpen/blob/master/dictionary/process/processE1.020191015.tsv

Please can you add the 'description' field

I have fixed all typos and there is added wikidata id column.

Dictionary for process terms is processE1.020191018.xml https://github.com/petermr/CEVOpen/blob/master/dictionary/process/processE1.020191018.xml

Thank you I will check this.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/petermr/CEVOpen/issues/37?email_source=notifications&email_token=AAFTCS6BJ7ZYL7ICBPGK3GDQPF2H5A5CNFSM4JAGFXHKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBTNB2Q#issuecomment-543609066, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAFTCS6LAT3KOYSVI2PZXSTQPF2H5ANCNFSM4JAGFXHA .

-- Peter Murray-Rust Founder ContentMine.org and Reader Emeritus in Molecular Informatics Dept. Of Chemistry, University of Cambridge, CB2 1EW, UK

ambarishK commented 5 years ago

Sir, I have updated the sheet for extracted process terms - processoil18620191015.tsv and added description column.

petermr commented 5 years ago

budding: form of asexual reproduction//no Wikidata ID female plants: article // rubbish

On Fri, Oct 18, 2019 at 10:40 AM Ambarish Kumar notifications@github.com wrote:

Sir, I have updated the sheet for extracted process terms - processE1.020191015.tsv https://github.com/petermr/CEVOpen/blob/master/dictionary/process/processE1.020191015.tsv and added description column.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/petermr/CEVOpen/issues/37?email_source=notifications&email_token=AAFTCSYDBC3OFLID6WKOYPLQPGACFA5CNFSM4JAGFXHKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBTTPHY#issuecomment-543635359, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAFTCSYWK4MCLMKJUUQETYDQPGACFANCNFSM4JAGFXHA .

-- Peter Murray-Rust Founder ContentMine.org and Reader Emeritus in Molecular Informatics Dept. Of Chemistry, University of Cambridge, CB2 1EW, UK

ambarishK commented 5 years ago

Sir, made all changes.