Closed victoriastuart closed 7 years ago
@victoriastuart Thanks a lot, you just saved me a lot of time!
Hi @victoriastuart @nkruglikov I am new to python can you please help me out with training the model using GoogleNews word embeddings? I am trying to train using the script
python train.py --train dataset/eng.train --dev dataset/eng.testa --test dataset/eng.testb --lr_method=adam --tag_scheme=iob --pre_emb=GoogleNews-vectors-negative300.bin --all_emb=300
I got this error:
I am stuck with this issue for about 2 months and couldn't resolve it. Thanks in advance.
Two issues:
Others (e.g. issues #20 , #41 ) asked what a 'tokenized sentence' is; that puzzled me too. Answer: any sentence is 'tokenized'; e.g.
Victoria was born in 1961 in Halifax, Nova Scotia, Canada.
If your input file contains blank lines, e.g.
then
tagger.py
|utils.py
throws an error:You can solve that, simply, by changing the following lines in
tagger.py
Original:
Modified:
Added lines: