divvun / libdivvun

lib for running gramcheck and other pipelines + cli; modules for CG→spelling, CG→feedback, tagging blanks
https://giellalt.github.io/proof/gramcheck/GrammarCheckerDocumentation.html
GNU General Public License v3.0
9 stars 1 forks source link

cgspell <spellskip> unpredictable and untoggleable #16

Closed TinoDidriksen closed 5 years ago

TinoDidriksen commented 5 years ago

Pipe: echo ... | kal-tokenise | divvun-cgspell -n 5 kl.zhfst

Input illu oqaaseq Vester wjgxyzæøå xyzæøå. yields output with <spellskip>:

"<illu>"
        "ih" Interj LU
        "illu" N Abs Sg
"<oqaaseq>"
        "oqaaseq" N Abs Sg
        "oqar" Gram/IV USIQ Der/vn N Abs Sg
        "oqar" Gram/TV Gram/Refl USIQ Der/vn N Abs Sg
"<Vester>"
        "Vester" ?
        "Vester" ? <spellskip>
"<wjgxyzæøå>"
        "wjgxyzæøå" ?
        "wjgxyzæøå" ? <spellskip>
"<xyzæøå>"
        "xyzæøå" ?
        "xyzæøå" ? <spellskip>
"<.>"
        "." CLB

But input Vester wjgxyzæøå xyzæøå. by itself does not give <spellskip>:

"<Vester>"
        "Vester" ?
        "mester" <W:10> <WA:0> <spelled> "<mester>"
        "bister" <W:20> <WA:0> <spelled> "<bister>"
        "center" <W:20> <WA:0> <spelled> "<center>"
        "meter" <W:20> <WA:0> <spelled> "<meter>"
        "festeq" <W:20> <WA:0> <spelled> "<festeq>"
"<wjgxyzæøå>"
        "wjgxyzæøå" ?
"<xyzæøå>"
        "xyzæøå" ?
        "xxxyzæøå" <W:20> <WA:0> <spelled> "<xxxyzæøå>"
        "xxxzæøå" <W:20> <WA:0> <spelled> "<xxxzæøå>"
        "xxxæøå" <W:20> <WA:0> <spelled> "<xxxæøå>"

Neither does it <spellskip> if the input is illu Vester wjgxyzæøå xyzæøå. or Vester wjgxyzæøå xyzæøå illu.

I would like a way to disable this <spellskip> logic, because it is not cgspell's job to potentially detect foreign quotes or whatever it is trying to do.