bootphon / wordseg

A Python toolbox for text based word segmentation
https://docs.cognitive-ml.fr/wordseg
GNU General Public License v3.0
16 stars 7 forks source link

Syllabification #4 - separators #39

Closed GladB closed 6 years ago

GladB commented 6 years ago

There is a problem when stating different phone and word separators (with the parameters -p ' ' -w ';eword' for example), the only case working for me is when there is no phone separator in the file and spaces as word separator (in which case I use "-p '_' -w ' ' "); moreover, it would be useful to be able to specify what the syllable separator should be in the output (it is ";esyll" for now).

mmmaat commented 6 years ago
GladB commented 6 years ago
mmmaat commented 6 years ago

I fixed the ;eword;eword problem.

Spaces are supported as syllable separators:

$ echo "n;o;word" | wordseg-syll -p ";" -s " " -w "word" cspanish_onsets.txt cspanish_vowels.txt
n;o; word