petermr / tigr2ess

Materials for TIGR2ESS workshop in Delhi Feb 2019 - joint UK(Cambridge) - India project on Food Security.
Other
4 stars 10 forks source link

`ami-search-cooccur` - Minor inconsistency in commonest.dataTables.html #14

Closed ay-amityadav closed 5 years ago

ay-amityadav commented 5 years ago

OS: Ubuntu 16.04

Step1: Using getpapers 100 research articles in xml format are downloaded for query 'millets', the whole thing is stored stored under millets_xml directory. Step2: The command ami-search-cooccur is issued to run search and co-occurence on the above millets_xml project with the dictionaries for species, gene, plantparts, drugs and monoterpene. The exact command entered at the terminal is:

ami-search-cooccur millets_xml/ species gene plantparts drugs monoterpene

One of the files generated from the above is commonest.dataTables.html, which looks as following in a web-browser.

commonest_datatables

If we look at the header row of the table, we find some inconsistencies:

  1. column named gene:human should probably be something like dic:gene. Not sure of the interpretation of word human here.
  2. Also, column named species:binomial could be renamed to dic:species to match our input dictionary name.
petermr commented 5 years ago

Many thanks. "gene" and "species" are not dictionaries but hardcoded lexical analyzers. This must be documented better.

On Sat, Feb 9, 2019 at 1:52 AM amit yadav notifications@github.com wrote:

Assigned #14 https://github.com/petermr/tigr2ess/issues/14 to @petermr https://github.com/petermr.

— You are receiving this because you were assigned. Reply to this email directly, view it on GitHub https://github.com/petermr/tigr2ess/issues/14#event-2128513007, or mute the thread https://github.com/notifications/unsubscribe-auth/AAsxSwlhO-ZnSD2PQA_7gZDOCDrgET8qks5vLinjgaJpZM4ayUKL .

-- Peter Murray-Rust Founder ContentMine.org and Reader Emeritus in Molecular Informatics Dept. Of Chemistry, University of Cambridge, CB2 1EW, UK

vinitamehlawat commented 5 years ago

Yes sir ,I will edit this in Documment in a better way. Thank You