kawu / concraft-pl

A morphosyntactic tagger for Polish based on conditional random fields
http://zil.ipipan.waw.pl/Concraft
BSD 2-Clause "Simplified" License
20 stars 2 forks source link

Idea: limit possibile tags based on shape #2

Open kawu opened 11 years ago

kawu commented 11 years ago

It may be a good idea to limit possible interpretations of OOV words on the basis of the shape. For example, given word x with shape sh(x), limit the set of x's possible interpretations to a set of tags assigned to words in the training corpus with shape sh(x).