leondz / cavat

Automatically exported from code.google.com/p/cavat
3 stars 1 forks source link

Incorrect element ID format causes text and position to be omitted #72

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
If we create a signal with a sid of "1601" instead of a sid in the format 
<letter s><digits>, the text, position, sentence and inSentence fields are not 
populated.

doc_id  sid text    position    sentence    inSentence 
4   1601    NULL    NULL    NULL    NULL

There are two fixes; in order of thoroughness (and inverse order of ease of 
implementation):
- Allow any element to have any string as its label
- Throw an exception when reading in an element whose label is not in the 
correct format

Original issue reported on code.google.com by leonderczynski on 1 Jan 2011 at 1:39