singnet / language-learning

OpenCog Unsupervised Language Learning
https://wiki.opencog.org/w/Language_learning
MIT License
32 stars 11 forks source link

LGParseError "Number of sentences in corpus and reference files missmatch". #172

Closed OlegBaskov closed 5 years ago

OlegBaskov commented 5 years ago

4 of 5 tests pass OK.
The 5th (notebook sell 15) crashes with LGParseError: Number of sentences in corpus and reference files missmatch. Reference file '/home/obaskov/94/language-learning/data/GCB/LG-E-clean/GCB-LG-English-clean.ull' does not match its corpus counterpart 104341 != 104340.
The corpus is extracted from reference file in all the 5 tests.

Static html copy of the notebook -- GCB-LG-E-clean-ALE-MWC=1-MSL=10-2019-02-17_LGParseError.html, error in cell 15.
The faulty grammar directory -- GCB-LG-E-clean-ALE-MWC=1-MSL=10-2019-02-17_LGParseError/GCB_LG-E-clean_cALWEd_no-gen_20c/