gilienv / EssOilDB

Restructuring of Essential Oil Database
Apache License 2.0
8 stars 6 forks source link

create priority list of journals for machine extraction of profiles #45

Open petermr opened 5 years ago

petermr commented 5 years ago

List the most important journals for future development of extraction software Foreach give an indication of:

vinitamehlawat commented 5 years ago

Most Important journals for Future development of Extraction Software are:

Number of profile papers per year: ( approximate Number per year )

Here I used query ' Phytochemical Profile/Essential oil composition in Plants ' from ' 2017-2018 '

@petermr I am unable to understand this ' the ease of extracting profiles ' ?

petermr commented 5 years ago

Thanks, excellent

On Mon, Jun 24, 2019 at 11:26 AM vinitamehlawat notifications@github.com wrote:

PMR> "A" are repositories, possibly with transformed content "B" are journals "C" are bibliographic search engines

The priority is B > A >> C. Getting anything out of C requires a lot of scraping

Most Important journals for Future development of Extraction Software are:

  • EuropePMC A
  • PUBMED A
  • Fitotropia B
  • Journal of Essential Oil Research B
  • Journal of essential oil bearing plant B
  • Web of Science C
  • Google scholar C
  • SCOPUS C
  • Flavour & Fragnance Journal B
  • Phytochemistry Journal B
  • Natural Product Research B
  • Molecules B

Number of profile papers per year: ( approximate Number per year )

Here I used query ' Phytochemical Profile/Essential oil composition in Plants ' from ' 2017-2018 '

  • EuropePMC =1843
  • PUBMED=1600
  • Fitotropia// Unable to find Number for this journal
  • Journal of Essential Oil Research=3034
  • Journal of essential oil bearing plant =2300
  • Web of Science=3500
  • Google scholar=1800
  • SCOPUS=1200
  • Flavour & Fragnance Journal// Unable to find Number for this journal
  • Phytochemistry Journal// Unable to find Number for this journal
  • Natural Product Research=2700
  • Molecules Journal=4000

Are you sure of these figures? That "Molecules" has 4000 profiles per year? It seems high to me, but if so it's good

@petermr https://github.com/petermr I am unable to understand this ' the ease of extracting profiles ' ?

JEOR is very easy as all tables are exposed as CSV. The order of ease is roughly:

CSV or separate HTML Tables (easy) Tables in running HTML text (fairly easy) Tables in PDF (very hard)

I'd like to see some eamples of these on Thursday (was Wed), thanks!

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/gilienv/EssOilDB/issues/45?email_source=notifications&email_token=AAFTCS6BRDV4TDJBXSTC6OLP4COMXA5CNFSM4H2Q4TXKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODYMPN2I#issuecomment-504952553, or mute the thread https://github.com/notifications/unsubscribe-auth/AAFTCSYFYMFOB5MB4R7JQ7TP4COMXANCNFSM4H2Q4TXA .

-- Peter Murray-Rust Founder ContentMine.org and Reader Emeritus in Molecular Informatics Dept. Of Chemistry, University of Cambridge, CB2 1EW, UK

vinitamehlawat commented 5 years ago

ohh it's My mistake Because In Molecule Journal when we search for any Query it showed results on Word basis, Not as Whole Query. It showed 378 Results for Phytochemical & 4790 for Profile .I didn't notice that REALLY sorry for that.

petermr commented 5 years ago

On Mon, Jun 24, 2019 at 2:31 PM vinitamehlawat notifications@github.com wrote:

ohh it's My mistake Because In Molecule Journal when we search for any Query it showed results on Word basis, Not as Whole Query. It showed 378 Results for Phytochemical & 4790 for Profile .I didn't notice that REALLY sorry for that.

No problem - this is a good lesson to learn. Current searches always have a LOT of false positives. You have to look in detail at what you have retrieved... in many cases only 1-10% of retrieved documents are relevant.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/gilienv/EssOilDB/issues/45?email_source=notifications&email_token=AAFTCS4RAK4754PIMPZQZ2LP4DEEXA5CNFSM4H2Q4TXKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODYM5XMA#issuecomment-505011120, or mute the thread https://github.com/notifications/unsubscribe-auth/AAFTCS62237RHESHWONWSKLP4DEEXANCNFSM4H2Q4TXA .

-- Peter Murray-Rust Founder ContentMine.org and Reader Emeritus in Molecular Informatics Dept. Of Chemistry, University of Cambridge, CB2 1EW, UK