apertium / lttoolbox

Finite state compiler, processor and helper tools used by apertium
http://wiki.apertium.org/wiki/Lttoolbox
GNU General Public License v2.0
18 stars 22 forks source link

Format of epsilon symbol from lt-print #92

Open ftyers opened 4 years ago

ftyers commented 4 years ago

This is a bit bikeshedding, but I think that we should change the epsilon symbol in lt-print from ε either to <ε> or to @0@. It probably needs to change if we're ever going to deal with Greek, although it's very low priority.

My feeling is that <ε> is more Apertiumy. This could also be updated in HFST's hfst-txt2fst tool so that the -e option is not necessary.

mr-martian commented 4 years ago

Though we do already have lt-print -H to switch from ε to @0@.

mr-martian commented 3 years ago

I suppose one way to handle the Greek issue in lt-comp would be to have an option to tell it to use HFST symbols instead of lt-print symbols so that it won't interpret ε.

mr-martian commented 3 years ago

I just ran into this issue, so now in the branch of #119 lt-comp -H will make ε be a character rather than being treated as an empty transition.