Closed ekdnam closed 3 years ago
Hi! How and what version of MXNet did you install? It seems like an issue related to MXNet and the specific GPU available on Google Colab.
Hi! How and what version of MXNet did you install?
I have installed MXNet as mentioned in requirements.gpu-cu110.txt, which is mxnet-cu110==1.8.0.post0
, by executing
pip install sockeye --no-deps -r requirements.gpu-cu110.txt
It seems like an issue related to MXNet and the specific GPU available on Google Colab.
What can be the issue with MXNet?
I, personally, have never faced any problems with GPUs on Google Colab. Can you perhaps give some more information about what the error can be related to GPUs?
Thanks! So MXNet has kernels for different device types built into its binaries. It seems there is a mismatch between what it was built with and what is required by Google Colab. Unfortunately, this is a MXNet issue. Could you open an issue on the MXNet repository as I think MXNet developers would maybe be able to point out a potential way forward. https://github.com/apache/incubator-mxnet
Okay. Thanks for your response.
I have created an issue on the MXNet repo (apache/incubator-mxnet/issues/20469), let's see how it goes.
Thanks! 🤞 I will close the issue here and we can continue on the MXNet issue.
I am currently following this tutorial on Zero-Shot Translation, the notebook (on Google Colab) can be viewed here
In the training step, for some reason, Sockeye is not able to acquire a GPU
The entire output is this (I have to interrupt the execution of the kernel)
How to resolve this?
Note: The files mentioned in the notebook (taken from the aforementioned tutorial) can be viewed here.