we still have cases were names 'A.B.Fulano' in a single token
we may have other tokens that need to be split. We can search for . or - inside token forms.
we have some cases of WF tokens with sep=`, the space is the default sep, need to remove those cases and check if the detokenization approach still works matching thetext` field.
after #9
.
or-
inside token forms.`, the space is the default sep, need to remove those cases and check if the detokenization approach still works matching the
text` field.