We want to implement (very) basic LID schemes with CRF or structured SVM. Then we can see the result and find out whether it could be improved or not. We'll use PyStruct for this purpose. At the first stage, we don't need a full dataset. Just make some development set by hand (50~ would be suffice), and develop some identifier.
Before developing identifiers, please study the topic and how to use the library idiomatically. Fixing bugs in a legacy code is much harder than writing a new code from scratch, especially for those who are not code owners.
We want to implement (very) basic LID schemes with CRF or structured SVM. Then we can see the result and find out whether it could be improved or not. We'll use PyStruct for this purpose. At the first stage, we don't need a full dataset. Just make some development set by hand (50~ would be suffice), and develop some identifier.
Before developing identifiers, please study the topic and how to use the library idiomatically. Fixing bugs in a legacy code is much harder than writing a new code from scratch, especially for those who are not code owners.