Running the AMR pipeline, ToAmConll crashed on the training data because of an issue in the handling of named entities using CoreNLP.
This is the error message and the output:
0
1000
[crickets]
[<, crickets, >]
Exception in thread "main" java.lang.IndexOutOfBoundsException: Index: 1, Size: 1
at java.util.ArrayList.rangeCheck(ArrayList.java:657)
at java.util.ArrayList.get(ArrayList.java:433)
at de.saar.coli.amrtagging.formalisms.amr.tools.preproc.StanfordNamedEntityRecognizer.tag(StanfordNamedEntityRecognizer.java:40)
at de.saar.coli.amrtagging.formalisms.amr.tools.ToAMConll.main(ToAMConll.java:198)
There's indeed a sentence that is < crickets >, which might be a problem for code that does replaceAll("[<>]", "").
@alexanderkoller, since you wrote the code: is that an easy fix for you?
Running the AMR pipeline,
ToAmConll
crashed on the training data because of an issue in the handling of named entities using CoreNLP.This is the error message and the output:
There's indeed a sentence that is
< crickets >
, which might be a problem for code that doesreplaceAll("[<>]", "")
.@alexanderkoller, since you wrote the code: is that an easy fix for you?