Closed jiehanzheng closed 11 years ago
I think this is indeed a JDK (or maybe java version) incompatibility. I use:
java version "1.6.0_51"
Java(TM) SE Runtime Environment (build 1.6.0_51-b11-457-10M4509)
Java HotSpot(TM) 64-Bit Server VM (build 20.51-b01-457, mixed mode)
Looking into it...
In the meantime, here's a jar built from https://github.com/sammthomson/semafor/tree/a25f817027463923ea21166b2f43464722273fe8 :
https://www.dropbox.com/s/73xpcra0w3w0oyt/Semafor-3.0-alpha-04.jar
@sammthomson Thanks for the JAR file! Let me know if I can help on testing / debugging.
Cool. That was fast. Thank you!
when I try to run semafor with this jar file it stops in the Converting postagged input to conll phase.
Environment variables: SEMAFOR_HOME=/opt/semafor CLASSPATH=.:/opt/semafor/target/Semafor-3.0-alpha-04.jar JAVA_HOME_BIN=/usr/lib/jvm/java-6-oracle/bin MALT_MODEL_DIR=/opt/semafor_malt_model_20121129 TEMP_DIR: /tmp/semafor.oHswfdoPiw Environment variables: SEMAFOR_HOME=/opt/semafor CLASSPATH=.:/opt/semafor/target/Semafor-3.0-alpha-04.jar JAVA_HOME_BIN=/usr/lib/jvm/java-6-oracle/bin MALT_MODEL_DIR=/opt/semafor_malt_model_20121129 Environment variables: SEMAFOR_HOME=/opt/semafor CLASSPATH=.:/opt/semafor/target/Semafor-3.0-alpha-04.jar JAVA_HOME_BIN=/usr/lib/jvm/java-6-oracle/bin MALT_MODEL_DIR=/opt/semafor_malt_model_20121129
Tokenizing file: Data/Cause.txt
real 0m0.039s user 0m0.000s sys 0m0.000s Finished tokenization.
Part-of-speech tagging tokenized data.... /opt/semafor/scripts/jmx /opt/semafor/bin Read 11692 items from tagger.project/word.voc Read 45 items from tagger.project/tag.voc Read 42680 items from tagger.project/tagfeatures.contexts Read 42680 contexts, 117558 numFeatures from tagger.project/tagfeatures.fmap Read model tagger.project/model : numPredictions=45, numParams=117558 Read tagdict from tagger.project/tagdict This is MXPOST (Version 1.0) Copyright (c) 1997 Adwait Ratnaparkhi Sentence: 0 Length: 1 Elapsed Time: 0.024 seconds. Sentence: 1 Length: 0 Elapsed Time: 0.0 seconds.
real 0m1.937s user 0m0.800s sys 0m0.048s /opt/semafor/bin Finished part-of-speech tagging tokenized data.
Converting postagged input to conll. Exception in thread "main" java.lang.IllegalArgumentException: at edu.cmu.cs.lti.ark.fn.data.prep.formats.SentenceCodec.decode(SentenceCodec.java:83) at edu.cmu.cs.lti.ark.fn.data.prep.formats.SentenceCodec$SentenceIterator.computeNext(SentenceCodec.java:115) at edu.cmu.cs.lti.ark.fn.data.prep.formats.SentenceCodec$SentenceIterator.computeNext(SentenceCodec.java:100) at com.google.common.collect.AbstractIterator.tryToComputeNext(AbstractIterator.java:143) at com.google.common.collect.AbstractIterator.hasNext(AbstractIterator.java:138) at edu.cmu.cs.lti.ark.fn.data.prep.formats.ConvertFormat.convertStream(ConvertFormat.java:94) at edu.cmu.cs.lti.ark.fn.data.prep.formats.ConvertFormat.main(ConvertFormat.java:76) Caused by: java.lang.IllegalArgumentException: PosToken must have 2 "_"-separated fields at com.google.common.base.Preconditions.checkArgument(Preconditions.java:92) at edu.cmu.cs.lti.ark.fn.data.prep.formats.Token.fromPosTagged(Token.java:248) at edu.cmu.cs.lti.ark.fn.data.prep.formats.SentenceCodec$2.decodeToken(SentenceCodec.java:28) at edu.cmu.cs.lti.ark.fn.data.prep.formats.SentenceCodec.decode(SentenceCodec.java:79) ... 6 more
Any help you can give will be greatly appreciated.
This comment is a duplicate of #20, let's discuss over there.
So I just checked out semafor and tried to build it using Maven:
Could this be a JDK incompatibility?
I am new to Java, so maybe I am doing something wrong here. If anyone could point out the problem (or share a compiled JAR with me for now first) that would be wonderful! Thanks!