Open GoogleCodeExporter opened 8 years ago
Add:
I get the same message:
--------
IFLAG= -1
LINE SEARCH FAILED. SEE DOCUMENTATION OF ROUTINE MCSRCH
ERROR RETURN OF LINE SEARCH: INFO= 3
POSSIBLE CAUSES: FUNCTION OR GRADIENT ARE INCORRECT
OR INCORRECT TOLERANCES
--------
when I use different data sets and count merging for interpolation. I used the
following commands:
estimate-ngram -order 3 -v wlist -unk true -t train.txt -opt-perp opt.txt -wl
arpa_a.gz -wec arpa_a.effcounts
...
gzip -d -f arpa_a.gz
...
interpolate-ngram -l "arpa_a, arpa_b, arpa_c" -interpolation CM -opt-perp
ip.txt -wl arpa_full.gz
Original comment by sebastia...@googlemail.com
on 25 Oct 2010 at 2:55
These error messages from Powell's search algorithm suggest that the search
surface may be so flat that the changes in the interpolation weights resulted
in only random fluctuations due to numerical precision limitations. As you
pointed out, the resulting weights are still usable and thus generated a valid
LM. It should be okay to ignore these warnings.
In this particular case, are the component LMs trained from small text corpora
or with little overlap in vocabulary? Is the tuning set particularly small?
If so, this may explain why the interpolated model is not sensitive to the
interpolation parameters. To test this hypothesis, you can try specifying the
-params argument in interpolation-ngram to test the perplexity of the tuning
data with different interpolation weights.
Original comment by bojune...@gmail.com
on 27 Oct 2010 at 3:25
Original issue reported on code.google.com by
sebastia...@googlemail.com
on 25 Oct 2010 at 12:29