Closed MaksymDel closed 6 years ago
Resolved by removing all the model folder, regenerating data and re-running the script from scratch.
By default Marian resumes training when it sees that model folders are not free, right?
Yes. It does. With the example it is still a little bit wacky as the smoothed models (--exponential-smoothing
) should not be the models which are used for resuming, but it does not seem to do harm either. We are currently working on making this fully correct.
BTW, these are the lines counts for files in the data folder:
19122526 data/all.bpe.de
19122526 data/all.bpe.en
4561263 data/corpus.bpe.de
4561263 data/corpus.bpe.en
4590101 data/corpus.de
4590101 data/corpus.en
4561263 data/corpus.tc.de
4561263 data/corpus.tc.en
157788 data/corpus.tok.de
4590101 data/corpus.tok.en
4590101 data/corpus.tok.uncleaned.de
4590101 data/corpus.tok.uncleaned.en
10000000 data/news.2016.bpe.de
10000000 data/news.2016.bpe.en
10000000 data/news.2016.de
10000000 data/news.2016.tc.de
10000000 data/news.2016.tok.de
2737 data/test2014.bpe.en
2737 data/test2014.en
2737 data/test2014.tc.en
2737 data/test2014.tok.en
2169 data/test2015.bpe.en
2169 data/test2015.en
2169 data/test2015.tc.en
2169 data/test2015.tok.en
2999 data/test2016.bpe.en
2999 data/test2016.en
2999 data/test2016.tc.en
2999 data/test2016.tok.en
3004 data/test2017.bpe.en
3004 data/test2017.en
3004 data/test2017.tc.en
3004 data/test2017.tok.en
2999 data/valid.bpe.de
2999 data/valid.bpe.en
2999 data/valid.de
2999 data/valid.en
2999 data/valid.tc.de
2999 data/valid.tc.en
2999 data/valid.tok.de
2999 data/valid.tok.en
Thanks!
Closing for now.
Part of my stdout output:
After that script continues.
I use 16gb GPU to train the model. Any ideas on this?