Open Gurkiratsinghk opened 3 years ago
I have downloaded and deleted the file 7 times
Assuming you're using the train.py from nsheppard's fork, try running it with --save_every N
where N is the number of steps before it auto-saves (default 1000).
For example: python train.py --dataset data.npz --save_every 10
Traceback (most recent call last): File "interactive_conditional_samples.py", line 89, in
fire.Fire(interact_model) File "H:\Anaconda\lib\site-packages\fire\core.py", line 141, in Fire component_trace = _Fire(component, args, parsed_flag_args, context, name) File "H:\Anaconda\lib\site-packages\fire\core.py", line 471, in _Fire target=component.name) File "H:\Anaconda\lib\site-packages\fire\core.py", line 681, in _CallAndUpdateTrace component = fn(*varargs, kwargs) File "interactive_conditional_samples.py", line 45, in interact_model enc = encoder.get_encoder(model_name) File "U:\gpt-2\gpt-2\encoder.py", line 110, in get_encoder encoder = json.load(f) File "H:\Anaconda\lib\json__init__.py", line 296, in load parse_constant=parse_constant, object_pairs_hook=object_pairs_hook, kw) File "H:\Anaconda\lib\json__init__.py", line 348, in loads return _default_decoder.decode(s) File "H:\Anaconda\lib\json\decoder.py", line 337, in decode obj, end = self.raw_decode(s, idx=_w(s, 0).end()) File "H:\Anaconda\lib\json\decoder.py", line 355, in raw_decode raise JSONDecodeError("Expecting value", s, err.value) from None json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)
A new error has popped up in place of it
Did you change the "data.npz" to point to where your dataset is? Or better yet, try running the same train.py command as in the original post and just add --save_every 10
to the end of that.
Actually, I collected all the file in one single folder. And when I run the command which you are suggesting, it gives an error related to the JSON file. The one I have mentioned above.
As you say, i want to have a question for it that checkpoints have a or some .ckpt files?
I ran the
train.py
program of GPT-2 on a txt training data which has 3 stories. I used the 117M parameters model, and it runs, it trains the model, but once it stops it creates checkpoint folder inside it is run1 folder, but none of these files are generated:What should I do?