I am running with a very big file: about 150M lines, disk size 60GB, --num-workers 10, and then :
'vocab += pickle.load(f)' in learn_bpe.py will report error: EOFError: Ran out of input.
tested on windows 10 os. I assume the 'tmp = tempfile.NamedTemporaryFile' introduce this?
anyone has such experience?
I am running with a very big file: about 150M lines, disk size 60GB, --num-workers 10, and then : 'vocab += pickle.load(f)' in learn_bpe.py will report error: EOFError: Ran out of input.
tested on windows 10 os. I assume the 'tmp = tempfile.NamedTemporaryFile' introduce this? anyone has such experience?
thx