Closed GoogleCodeExporter closed 8 years ago
The default in Collins evalb bracket-evaluation program is to ignore empty
parses, and BUBS reimplementation of evalb does the same. Revision 8e75def adds
the a configuration option (-O evalParseFailures=true) which includes the empty
/ failed parses in bracket evaluation, penalizing recall.
Note: Since this defect was filed, other changes and bug-fixes eliminated the
parse failure from the reported command-line. The examples below trigger that
failure again and demonstrate the effect of -O evalParseFailures=true.
$ echo -e "(ROOT (SBAR (ADVP (DT The)) (WHADVP (WRB why)) (S (X (SYM x)) (X (FW
w) (FW v) (NP (PRP you)) (NN tee)) (NP (PRP s)) (VP (VBP are) (S (INTJ (JJ
queue) (NN pea) (INTJ (UH oh))) (INTJ (JJ n) (NN m) (VB el) (VB que) (VB jay))
(PRN (S (NP (PRP I)) (VP (VBP age) (ADVP (JJ gee)) (SBAR (IN if) (X (SYM
e)))))) (X (SYM thee)) (VP (VBP see) (NP (NN b) (NN a.))))))))\n(ROOT (S (NP
(DT This)) (VP (MD should) (VP (VB be) (ADJP (JJ easy)))) (. .)))"|
build-dist/parse -g models/eng.sm6.gr.gz -fom models/eng.sm6.fom.gz -beamModel
models/eng.sm6.bcm.gz -if Tree -O beamModelBias="-200,-200,-200,-200" -O
maxLocalDelta=5
INFO: parser=CartesianProductHashMl fom=models/eng.sm6.fom.gz decode=ViterbiMax
INFO: -g models/eng.sm6.gr.gz -fom models/eng.sm6.fom.gz -beamModel
models/eng.sm6.bcm.gz -if Tree -O beamModelBias=-200,-200,-200,-200 -O
maxLocalDelta=5
()
(ROOT (S (NP (DT This)) (VP (MD should) (VP (VB be) (ADJP (JJ easy)))) (. .)))
INFO: numSentences=2 numFail=1 reparsedSentences=0 totalReparses=0
totalSeconds=0.165 cpuSeconds=0.165 avgSecondsPerSent=0.083 wordsPerSec=187.879
f1=100.00 prec=100.00 recall=100.00
$ echo -e "(ROOT (SBAR (ADVP (DT The)) (WHADVP (WRB why)) (S (X (SYM x)) (X (FW
w) (FW v) (NP (PRP you)) (NN tee)) (NP (PRP s)) (VP (VBP are) (S (INTJ (JJ
queue) (NN pea) (INTJ (UH oh))) (INTJ (JJ n) (NN m) (VB el) (VB que) (VB jay))
(PRN (S (NP (PRP I)) (VP (VBP age) (ADVP (JJ gee)) (SBAR (IN if) (X (SYM
e)))))) (X (SYM thee)) (VP (VBP see) (NP (NN b) (NN a.))))))))\n(ROOT (S (NP
(DT This)) (VP (MD should) (VP (VB be) (ADJP (JJ easy)))) (. .)))"|
build-dist/parse -g models/eng.sm6.gr.gz -fom models/eng.sm6.fom.gz -beamModel
models/eng.sm6.bcm.gz -if Tree -O beamModelBias="-200,-200,-200,-200" -O
maxLocalDelta=5 -O evalParseFailures=true
INFO: parser=CartesianProductHashMl fom=models/eng.sm6.fom.gz decode=ViterbiMax
INFO: -g models/eng.sm6.gr.gz -fom models/eng.sm6.fom.gz -beamModel
models/eng.sm6.bcm.gz -if Tree -O beamModelBias=-200,-200,-200,-200 -O
maxLocalDelta=5 -O evalParseFailures=true
()
(ROOT (S (NP (DT This)) (VP (MD should) (VP (VB be) (ADJP (JJ easy)))) (. .)))
INFO: numSentences=2 numFail=1 reparsedSentences=0 totalReparses=0
totalSeconds=0.212 cpuSeconds=0.212 avgSecondsPerSent=0.106 wordsPerSec=146.226
f1=33.33 prec=100.00 recall=20.00
Original comment by aaron.du...@gmail.com
on 30 May 2013 at 7:26
Original issue reported on code.google.com by
adamteic...@gmail.com
on 9 Jan 2013 at 10:26