Jekub / Wapiti

A simple and fast discriminative sequence labeling toolkit ( http://wapiti.limsi.fr )
Other
251 stars 86 forks source link

Avoiding patterns and using my own features #21

Open usptact opened 7 years ago

usptact commented 7 years ago

Hi,

I have a dataset from which a 3rd party tool extracted features (works very well in CRFSuite). There are multiple string features extracted per word. Is there a way to train a model from such file or I am forced to reimplement the feature extraction using patterns?

When I try to train a model on my data, I get an error message: error: invalid feature: U-hi-WORD|C

The "U-hi-WORD|C" is the first feature for the first word in the first sequence in the utterance.

Thanks

usptact commented 7 years ago

@jekub bump