monikkinom / ner-lstm

Named Entity Recognition using multilayered bidirectional LSTM
540 stars 183 forks source link

get_conll_embeddings.py question #17

Closed gahu1125 closed 7 years ago

gahu1125 commented 7 years ago

for 'get_conll_embeddings.py', it is saying that I have to input locations of the train, test_a, test_b, and the use_model locations. Train, Test_a and Test_b are the txt files that I can copy from the data folder, correct? and use_model is the pkl file that I created using the wordvec or other embedding model, correct?

If this is the case, how am I truly training the model with the corpus that I ingested from the start? Or does that means I have to make my own train, test_a and test-b dataset?

shreenibhar commented 7 years ago

you need a different corpus to train word embedding then use these embedding in get_conll_embed whose data are available in data folder