Currently segmentation happens through orthography.py. All it would take to apply the clusterwise segmentation is from ipatok import clusterise and to replace tokenise with clusterise in line 12. But I don't know how to make that column appear in forms.csv eventually
Currently segmentation happens through orthography.py. All it would take to apply the clusterwise segmentation is
from ipatok import clusterise
and to replacetokenise
withclusterise
in line 12. But I don't know how to make that column appear in forms.csv eventually