petermr / tigr2ess

Materials for TIGR2ESS workshop in Delhi Feb 2019 - joint UK(Cambridge) - India project on Food Security.
Other
4 stars 10 forks source link

"Test run of the ami-search-new for the new release of the ami-jars - ami20190218c." #57

Closed ambarishK closed 5 years ago

ambarishK commented 5 years ago

Status of test run of the ami-search-new for the new release of the ami-jars - ami20190218c is successful.

It finds all cooccurrences and extracts all bibliographic information.

Screenshots.

ambarish123@ubuntu:~$ ami-search-new -p ricenew/ --dictionary country gene

Generic values (AMISearchTool)
================================
basename            null
cproject            /home/ambarish123/ricenew
ctree               
cTreeList           [ricenew/PMC6331594, ricenew/PMC6337123, ricenew/PMC6339371, ricenew/PMC6342930, ricenew/PMC6343365, ricenew/PMC6343895, ricenew/PMC6345848, ricenew/PMC6351596, ricenew/PMC6352273, ricenew/PMC6357162]
dryrun              false
excludeBase         null
excludeTrees        null
file types          []
forceMake           false
includeBase         null
includeTrees        null
log4j               
logfile             null
verbose             0

Specific values (AMISearchTool)
================================
dictionaryList       [country, gene]
dictionaryTop        null
dictionarySuffix     [xml]
ignorePlugins        []

cProject: ricenew
cannot find dictionary: country
SEARCH running legacy processors
SEARCH running JSON bibliography

running: word; word([frequencies])[{xpath:@count>20}, {w.stopwords:pmcstop.txt stopwords.txt}]...
running: search; search([country])[]...
running: gene; gene([human])[]0    [main] DEBUG org.contentmine.ami.dictionary.gene.HGNCDictionary  - is /org/contentmine/ami/plugins/gene/hgnc/hgnc.xml
...
create data tables

Message related not finding dictionary persists.

cProject: ricenew
cannot find dictionary: country
SEARCH running legacy processors
SEARCH running JSON bibliography

Formed directories and files as a test run.

ambarish123@ubuntu:~/ricenew$ ls
commonest.dataTables.html     PMC6342930
__cooccurrence                PMC6343365
count.dataTables.html         PMC6343895
entries.dataTables.html       PMC6345848
eupmc_fulltext_html_urls.txt  PMC6351596
eupmc_results.json            PMC6352273
full.dataTables.html          PMC6357162
gene.human.count.xml          search.country.count.xml
gene.human.documents.xml      search.country.documents.xml
gene.human.snippets.xml       search.country.snippets.xml
PMC6331594                    word.frequencies.count.xml
PMC6337123                    word.frequencies.documents.xml
PMC6339371                    word.frequencies.snippets.xml

There is concatenated __ symbol with the formed cooccurrence result directory, which seems extraneous.

petermr commented 5 years ago

The __cooccurrence is a new name, because it's not confused with the CTrees.