UppsalaNLP / uuparser

A transition-based parser for Universal Dependencies with BiLSTM word and character representations.
Apache License 2.0
80 stars 25 forks source link

Continue training on model #25

Closed vlangloisx3 closed 4 years ago

vlangloisx3 commented 4 years ago

I've been having some issues with the --continueParams option. The code that I am using to continue training a model is below:

uuparser --outdir ewt_output --datadir ud-treebanks-v2.6 --include "en_ewt" --continue --continueParams=params.pickle

My assumption is that the params file is in the same folder as the outdir, but I get the error: No such file or directory: 'params.pickle'. I've also tried inserting the actual path, but get the error: UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 0: invalid start byte.

Since neither of these seem to work, where does the Params file need to be in order to load it?

Thanks! Val

LoicGrobol commented 4 years ago

Hi ! Can you paste the command line you used when using the actual path and the traceback for the UnicodeDecodeError?

vlangloisx3 commented 4 years ago

Sure! Here is the command line I used when using the path: uuparser --outdir ewt_output --datadir ud-treebanks-v2.6 --include "en_ewt" --continue --continueParams=/Users/vlanglois/Desktop/project/ewt_output/en_ewt/params.pickle

and here is the traceback: Traceback (most recent call last): File "/Library/anaconda3/bin/uuparser", line 8, in <module> sys.exit(main()) File "/Library/anaconda3/lib/python3.7/site-packages/uuparser/parser.py", line 279, in main run(experiment,options) File "/Library/anaconda3/lib/python3.7/site-packages/uuparser/parser.py", line 36, in run stored_vocab, stored_options = pickle.load(paramsfp) File "/Library/anaconda3/lib/python3.7/codecs.py", line 322, in decode (result, consumed) = self._buffer_decode(data, self.errors, final) UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 0: invalid start byte

LoicGrobol commented 4 years ago

(Sorry for the latency, I am really low-bandwidth on uuparser right now but I'll try to fix this :-)

Can you try installing from the master branch (using pip install git+https://github.com/UppsalaNLP/uuparser)? There were some changes since the last release, which I suspect won't make a difference for this but it wouldn't hurt to be sure and at least we'd have up-to-date line numbers this way

Actually, I think I found the issue, can you try installing from the fixup branch using pip install git+https://github.com/UppsalaNLP/uuparser@fix-25 and see if it fixes the issue?

vlangloisx3 commented 4 years ago

I'm still getting the same error even after installing from the fixup branch. It could be something wrong on my end though.

(Thanks for taking the time to look into it!)

LoicGrobol commented 4 years ago

Oh, if you installed with the exact command I posted in an env where uuparser is already installed, it might not have been updated because I forgot the -U flag, could you retry with pip install -U git+https://github.com/UppsalaNLP/uuparser@fix-25? Sorry for the inconvenience

vlangloisx3 commented 4 years ago

Yes that solved the issue! Thank you so much!

LoicGrobol commented 4 years ago

Great, I'll merge the fix and push a release before the end of the week, then. Thank you for catching this one :-)

LoicGrobol commented 4 years ago

Closed by #26