Noahs-ARK / semafor

http://www.ark.cs.cmu.edu/SEMAFOR
GNU General Public License v3.0
96 stars 46 forks source link

Error: could not match input #9

Closed Liontooth closed 10 years ago

Liontooth commented 10 years ago

I'm running Semafor-3.0-alpha-04 and it's generally working great.

In this instance, the conll files gets created correctly (I can include it if that would help in debugging), but the frame-semantic parsing fails with

Error: could not match input

What could be going wrong?

Cheers, David

2014-09-01_1000_US_MSNBC_Meet_the_Press.conll


Performing frame-semantic parsing. input-file:/tmp/semafor.ajYsvERezG/conll output-file:/sweep/2014/2014-09/2014-09-01/2014-09-01_1000_US_MSNBC_Meet_the_Press.json model-dir:/mnt/tvspare/software/java/semafor-experimental/semafor/models/semafor_malt_model_20121129 numthreads:5 Initializing frame identification model... Reading serialized required data Done reading serialized required data Reading graph from: /mnt/tvspare/software/java/semafor-experimental/semafor/models/semafor_malt_model_20121129/sparsegraph.gz... Read graph successfully. Reading model parameters... 100000 200000 300000 400000 500000 600000 700000 800000 900000 1000000 1100000 1200000 1300000 1400000 1500000 1600000 1700000 1800000 1900000 2000000 2100000 2200000 2300000 2400000 2500000 2600000 2700000 2800000 2900000 3000000 Done reading model parameters. Initializing alphabet for argument identification.. 0 100000 200000 300000 400000 500000 600000 700000 800000 900000 1000000 1100000 1200000 1300000 1400000 1500000 1600000 1700000 1800000 1900000 2000000 2100000 2200000 2300000 2400000 2500000 2600000 2700000 2800000 2900000 3000000 parsed sentence 4 in 606 millis. parsed sentence 1 in 1009 millis. parsed sentence 5 in 531 millis. parsed sentence 0 in 1248 millis. parsed sentence 6 in 346 millis. parsed sentence 2 in 1366 millis. parsed sentence 8 in 262 millis. parsed sentence 10 in 191 millis. parsed sentence 11 in 89 millis. parsed sentence 9 in 377 millis. parsed sentence 7 in 595 millis. parsed sentence 15 in 69 millis. parsed sentence 14 in 95 millis. parsed sentence 16 in 311 millis. parsed sentence 13 in 617 millis. parsed sentence 12 in 673 millis. parsed sentence 20 in 2 millis. parsed sentence 18 in 136 millis. parsed sentence 21 in 36 millis. parsed sentence 22 in 23 millis. parsed sentence 17 in 552 millis. parsed sentence 24 in 126 millis. parsed sentence 26 in 22 millis. parsed sentence 23 in 159 millis. parsed sentence 27 in 68 millis. parsed sentence 29 in 23 millis. parsed sentence 25 in 177 millis. parsed sentence 28 in 241 millis. parsed sentence 19 in 1440 millis. parsed sentence 3 in 4081 millis. parsed sentence 33 in 8 millis. parsed sentence 31 in 20 millis. parsed sentence 30 in 30 millis. parsed sentence 34 in 29 millis. parsed sentence 32 in 84 millis. parsed sentence 35 in 88 millis. parsed sentence 40 in 0 millis. parsed sentence 39 in 12 millis. parsed sentence 42 in 1 millis. parsed sentence 43 in 0 millis. parsed sentence 44 in 4 millis. parsed sentence 45 in 61 millis. parsed sentence 36 in 101 millis. parsed sentence 41 in 120 millis. parsed sentence 46 in 52 millis. parsed sentence 37 in 146 millis. parsed sentence 50 in 2 millis. parsed sentence 38 in 0 millis. parsed sentence 47 in 442 millis. parsed sentence 53 in 11 millis. parsed sentence 49 in 442 millis. parsed sentence 52 in 77 millis. parsed sentence 56 in 1 millis. parsed sentence 57 in 55 millis. parsed sentence 48 in 518 millis. parsed sentence 51 in 497 millis. parsed sentence 58 in 54 millis. parsed sentence 54 in 155 millis. parsed sentence 59 in 89 millis. parsed sentence 62 in 33 millis. parsed sentence 61 in 55 millis. parsed sentence 65 in 20 millis. parsed sentence 66 in 6 millis. parsed sentence 55 in 207 millis. parsed sentence 64 in 51 millis. parsed sentence 69 in 9 millis. parsed sentence 63 in 119 millis. parsed sentence 67 in 89 millis. parsed sentence 68 in 93 millis. parsed sentence 72 in 34 millis. parsed sentence 70 in 93 millis. parsed sentence 74 in 16 millis. parsed sentence 76 in 7 millis. parsed sentence 75 in 29 millis. parsed sentence 77 in 21 millis. parsed sentence 73 in 67 millis. parsed sentence 79 in 100 millis. parsed sentence 71 in 223 millis. parsed sentence 81 in 40 millis. parsed sentence 83 in 48 millis. parsed sentence 82 in 58 millis. parsed sentence 84 in 49 millis. parsed sentence 86 in 7 millis. parsed sentence 85 in 63 millis. parsed sentence 78 in 294 millis. parsed sentence 80 in 405 millis. parsed sentence 60 in 941 millis. parsed sentence 90 in 0 millis. parsed sentence 89 in 4 millis. parsed sentence 93 in 0 millis. parsed sentence 88 in 6 millis. parsed sentence 87 in 13 millis. parsed sentence 92 in 17 millis. parsed sentence 97 in 11 millis. parsed sentence 94 in 36 millis. parsed sentence 95 in 36 millis. parsed sentence 99 in 0 millis. parsed sentence 100 in 10 millis. parsed sentence 101 in 17 millis. parsed sentence 98 in 44 millis. parsed sentence 102 in 26 millis. parsed sentence 96 in 105 millis. parsed sentence 104 in 42 millis. parsed sentence 103 in 58 millis. parsed sentence 107 in 0 millis. parsed sentence 106 in 6 millis. parsed sentence 108 in 49 millis. parsed sentence 112 in 5 millis. parsed sentence 113 in 7 millis. parsed sentence 91 in 218 millis. parsed sentence 111 in 55 millis. parsed sentence 116 in 17 millis. parsed sentence 114 in 88 millis. parsed sentence 109 in 150 millis. parsed sentence 117 in 49 millis. parsed sentence 118 in 52 millis. parsed sentence 121 in 9 millis. parsed sentence 122 in 0 millis. parsed sentence 120 in 60 millis. parsed sentence 119 in 163 millis. parsed sentence 115 in 233 millis. parsed sentence 123 in 126 millis. parsed sentence 126 in 9 millis. parsed sentence 125 in 96 millis. parsed sentence 128 in 70 millis. parsed sentence 129 in 15 millis. parsed sentence 130 in 23 millis. parsed sentence 124 in 215 millis. parsed sentence 131 in 23 millis. parsed sentence 127 in 121 millis. parsed sentence 105 in 956 millis. java.util.concurrent.ExecutionException: java.lang.Error: Error: could not match input at java.util.concurrent.FutureTask.report(FutureTask.java:122) at java.util.concurrent.FutureTask.get(FutureTask.java:188) at edu.cmu.cs.lti.ark.fn.Semafor$2.run(Semafor.java:176) at java.lang.Thread.run(Thread.java:744) Caused by: java.lang.Error: Error: could not match input at uk.ac.susx.informatics.Morpha.zzScanError(Morpha.java:52702) at uk.ac.susx.informatics.Morpha.next(Morpha.java:54582) at edu.cmu.cs.lti.ark.util.nlp.MorphaLemmatizer.getLemma(MorphaLemmatizer.java:23) at edu.cmu.cs.lti.ark.util.nlp.Lemmatizer$1.apply(Lemmatizer.java:17) at edu.cmu.cs.lti.ark.util.nlp.Lemmatizer$1.apply(Lemmatizer.java:15) at com.google.common.collect.Lists$TransformingRandomAccessList.get(Lists.java:491) at java.util.AbstractList$Itr.next(AbstractList.java:358) at java.util.AbstractCollection.toArray(AbstractCollection.java:141) at com.google.common.collect.ImmutableList.copyFromCollection(ImmutableList.java:284) at com.google.common.collect.ImmutableList.copyOf(ImmutableList.java:253) at com.google.common.collect.ImmutableList.copyOf(ImmutableList.java:223) at edu.cmu.cs.lti.ark.fn.data.prep.formats.Sentence.(Sentence.java:25) at edu.cmu.cs.lti.ark.util.nlp.Lemmatizer.addLemmas(Lemmatizer.java:15) at edu.cmu.cs.lti.ark.fn.Semafor.addLemmas(Semafor.java:330) at edu.cmu.cs.lti.ark.fn.Semafor.parseSentence(Semafor.java:225) at edu.cmu.cs.lti.ark.fn.Semafor$3.call(Semafor.java:199) at edu.cmu.cs.lti.ark.fn.Semafor$3.call(Semafor.java:195) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) ... 1 more Exception in thread "Thread-2" java.lang.RuntimeException: java.util.concurrent.ExecutionException: java.lang.Error: Error: could not match input at edu.cmu.cs.lti.ark.fn.Semafor$2.run(Semafor.java:182) at java.lang.Thread.run(Thread.java:744) Caused by: java.util.concurrent.ExecutionException: java.lang.Error: Error: could not match input at java.util.concurrent.FutureTask.report(FutureTask.java:122) at java.util.concurrent.FutureTask.get(FutureTask.java:188) at edu.cmu.cs.lti.ark.fn.Semafor$2.run(Semafor.java:176) ... 1 more Caused by: java.lang.Error: Error: could not match input at uk.ac.susx.informatics.Morpha.zzScanError(Morpha.java:52702) at uk.ac.susx.informatics.Morpha.next(Morpha.java:54582) at edu.cmu.cs.lti.ark.util.nlp.MorphaLemmatizer.getLemma(MorphaLemmatizer.java:23) at edu.cmu.cs.lti.ark.util.nlp.Lemmatizer$1.apply(Lemmatizer.java:17) at edu.cmu.cs.lti.ark.util.nlp.Lemmatizer$1.apply(Lemmatizer.java:15) at com.google.common.collect.Lists$TransformingRandomAccessList.get(Lists.java:491) at java.util.AbstractList$Itr.next(AbstractList.java:358) at java.util.AbstractCollection.toArray(AbstractCollection.java:141) at com.google.common.collect.ImmutableList.copyFromCollection(ImmutableList.java:284) at com.google.common.collect.ImmutableList.copyOf(ImmutableList.java:253) at com.google.common.collect.ImmutableList.copyOf(ImmutableList.java:223) at edu.cmu.cs.lti.ark.fn.data.prep.formats.Sentence.(Sentence.java:25) at edu.cmu.cs.lti.ark.util.nlp.Lemmatizer.addLemmas(Lemmatizer.java:15) at edu.cmu.cs.lti.ark.fn.Semafor.addLemmas(Semafor.java:330) at edu.cmu.cs.lti.ark.fn.Semafor.parseSentence(Semafor.java:225) at edu.cmu.cs.lti.ark.fn.Semafor$3.call(Semafor.java:199) at edu.cmu.cs.lti.ark.fn.Semafor$3.call(Semafor.java:195) at java.util.concurrent.FutureTask.run(FutureTask.java:262) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) ... 1 more parsed sentence 135 in 8 millis. parsed sentence 136 in 44 millis. parsed sentence 132 in 75 millis. parsed sentence 134 in 109 millis. parsed sentence 133 in 594 millis.

sammthomson commented 10 years ago

Thanks for the bug report! This is fixed in the develop branch, but not yet merged into master. I'll try to get it into master ASAP, but in the meantime you could try using develop...

Liontooth commented 10 years ago

Hey Sam -- very cool! But don't we need some of the changes to master, such as "XML conversion to JSON was missing extra FE layers", made after develop forked?

Cheers, Dave

Liontooth commented 10 years ago

Resolved with Morpha bugfix Sep 10, 2014 -- thanks!