Open spyysalo opened 11 years ago
Hi Sampo, thanks for your help. You are correct, Gimli fails because it does not find the MEDLINE identifier. However, it should not fail when no MEDLINE identifier is provided, since that format is also valid.
That problem will be corrected on the next version of Gimli (milestone 1.0.3).
Best regards, David
@davidcampos: Good to know! If I may make an additional suggestion, it would be helpful for applying the tagger to resources other than PubMed if the matching of ID lines was relaxed from requiring "MEDLINE" to something more generic, like ^###[^ ]+$
, to allow e.g. ###PMC:1234567-sec-01-Introduction
.
@spyysalo Thanks for the suggestion. Gimli will allow that on next version.
Hello,
I also have the similar problems in running Gimli. I use Cygdrive of Windows 7 to provide Linux-like environment. Running the queries mvn -v
and javac -version
are normal.
Follow the tutorials of http://bioinformatics.ua.pt/support/gimli/doc/index.html
When I input below query
./gimli.sh convert JNLPBA -c Genia4ERtask2.iob2 -e protein -g gdep.gz -o corpus.gz
It output the error that can not find or import pt.ua.tm.gimli.reader.JNLPBAReader
What is the problems?
Thank you
When running
./gimli.sh annotate JNLPBA -c JNLPBA-test.gz -o output.txt -m protein.gz,protein,fw,jnlpba_protein.config
after creating
JNLPBA-test.gz
andprotein.gz
from the JNLPBA data distribution filesGenia4ERtask1.iob2
andGenia4EReval1.raw
(resp.), gimli 1.0.2 crashes oni.e. last line of
If
Genia4ERtask2.iob2
andGenia4EReval2.raw
are used instead ofGenia4ERtask1.iob2
andGenia4EReval1.raw
, the system works as expected, further indicating that the issue is the absence of the ID lines of the form###MEDLINE:95385995
. (didn't check the effect of switching out just one.)At a minimum, the system should fail gracefully if medline IDs are not included, and preferably work normally without them. This is a valid variant of the JNLPBA format, and comparable ids are not available for all inputs.