summerlight / anlp

Applied Natural Language Processing project
Apache License 2.0
2 stars 2 forks source link

Implement basic LID schemes #8

Closed summerlight closed 8 years ago

summerlight commented 8 years ago

We want to implement (very) basic LID schemes with CRF or structured SVM. Then we can see the result and find out whether it could be improved or not. We'll use PyStruct for this purpose. At the first stage, we don't need a full dataset. Just make some development set by hand (50~ would be suffice), and develop some identifier.

Before developing identifiers, please study the topic and how to use the library idiomatically. Fixing bugs in a legacy code is much harder than writing a new code from scratch, especially for those who are not code owners.