Open nitishajain opened 3 years ago
To better identify the problem, could you tell me what dataset you are running on?
I have encountered the exact same issue both with WD-singer as well as FB-15k-237 subsets, makes me think its not quite a dataset specific issue..
Could you give your PyTorch version? I redownload and run the code without encountering any errors. Using FB15K-237-20% as an example, make sure you run the following commands in order:
unzip data.zip
./experiment.sh configs/fb15k-237-20.sh --process_data <gpu-id>
./experiment-emb.sh configs/fb15k-237-20-conve.sh --train <gpu-id>
./experiment-rs.sh configs/fb15k-237-20-rs.sh --train <gpu-id>
The Pytorch version is 1.7.0 I have tried creating a new environment and running the commands again in the correct order, but I am still getting the same error after training for 3 epochs.
I am sorry that I have run this code many times, but this error cannot be reproduced. What is your CUDA version?
The CUDA version is 11.0 thank you for your efforts, could you inform your version as well? I can try to reproduce in same environment.
Pytorch: 1.8.1 CUDA: 11.1
It seems that our environments are very similar.
Hello, I am trying to replicate the steps to train and test the model. After performing the data processing and pretraining of embeddings, I keep encountering the following runtime error when training the model for any dataset -
Any pointers to solve this issue would be most helpful..