Closed Marcus-Arcadius closed 2 years ago
You seem to have a double slash in your path, and you seem to be referencing your gdrive
from the colab directory.
Are you mounting your Google Drive correctly with your colab notebook? Here's a stackoverflow answer about the subject.
You seem to have a double slash in your path, and you seem to be referencing your
gdrive
from the colab directory.Are you mounting your Google Drive correctly with your colab notebook? Here's a stackoverflow answer about the subject.
Sorry for the late reply, I honestly thought no one would answer, yes I am mounting my drive to the notebook!
I will give this a try tomorrow morning and hopefully see if it works 😁
@bterrific2008 I managed to get it to finally work, but now I am experiencing these problems?
The error states FileNotFoundError
for the configs/colab_XL.json
file. Can you check if your working directory is in the gpt-neo repository, and if there exists a configs/colab_XL.json
file in your copy of the gpt-neo repo?
The error states
FileNotFoundError
for theconfigs/colab_XL.json
file. Can you check if your working directory is in the gpt-neo repository, and if there exists aconfigs/colab_XL.json
file in your copy of the gpt-neo repo?
I cannot find any file in my copy
The error states
FileNotFoundError
for theconfigs/colab_XL.json
file. Can you check if your working directory is in the gpt-neo repository, and if there exists aconfigs/colab_XL.json
file in your copy of the gpt-neo repo?I cannot find any file in my copy
Try loading configs/gpt3_XL_256_Pile.json
instead.
From my understanding of the notebook, whatever file you write to in the Set Model Configs
is the configuration file you should use for the Training from Scratch
step.
The help text for the --model
parameter of main.py
reads as: JSON file that contains model parameters
, so you should make sure whatever is contained there matches the config file you want to use.
From my understanding of the notebook, whatever file you write to in the
Set Model Configs
is the configuration file you should use for theTraining from Scratch
step.The help text for the
--model
parameter ofmain.py
reads as:JSON file that contains model parameters
, so you should make sure whatever is contained there matches the config file you want to use.
As I could not get it to work, I ended up using the pre-trained model for fine tuning - it worked. If in the future, I want to further train the fine-tuned model, what is the process?