chilynn / sequence-labeling

307 stars 167 forks source link

Sequential Labeling

Input Format

The first column is the char, the second column is the label(BMEO), there is an empty line between two sentences

N B

B M

A E

D O

an empty line

Z O

Z O

Z O

Z O

Z O

Output Format

NBAD\<@>NBA

ZZZZZ\<@>

Train

python train.py train.in model -v validation.in -c char_emb -e 10 -g 2

Test

python test.py model test.in test.out -c char_emb -g 2

Embedding

The first line of the embedding file is the number of char and embedding dimension, seperating by space, e.g 5 10. The remaining line is the char and embedding vector, seperating by space, e.g N dim1 ... dim 10

Installation Dependencies

References