Open bolak92 opened 8 months ago
I believe this is a bug for other models as well. I'm running a TextRepresentation + DistMult interaction model and despite having 80G of VRAM PyKEEN still tries to allocate 14.90G more than I have. Conincidentally that's OOM by exactly the same margin as in your example.
Hi @bolak92 ,
could you try whether https://github.com/pykeen/pykeen/pull/1261 has solved your issue? It's not yet in a release but you can use it by installing from source
pip install git+https://github.com/pykeen/pykeen.git
I can confirm that I get this same issue on the latest version, when using apple silicon/"mps" device. i.e consistently crashes on evaluation due to OOM when using "mps" (Macbook Pro M3).
Describe the bug
Unlike the other models, when I train TransE model it fails after few epochs (around 19) with an error
torch.cuda.OutOfMemoryError
This was tested on several GPUs and machines but gives the same result.How to reproduce
Environment
Additional information
No response
Issue Template Checks