Closed SyauqiMA closed 1 year ago
This problem refers to the GoogleColabGPU.ipynb script, sorry for not pointing that out earlier!
Thanks for the detailed bug report! I am sorry our GPU notebook did not work for you.
The issue is that OpenCL is currently broken on Colab, which is what GpuOwl uses. See this post and my response a few posts below for more information: https://mersenneforum.org/showthread.php?p=627996#post627996.
From your provided output, note that my GpuOwl install script has a simple OpenCL check, which is currently failing:
Number of platforms 0
Error: This computer does not have an OpenCL platform
Our GPU notebook was never expecting the install script to fail in this way, so it causes that cascading series of errors that you saw. As a workaround, you could add a simple sed
command to the install()
function in the notebook to disable this OpenCL check:
!sed -i '/^if command -v clinfo/,/^fi/ s/^/# /' gpuowl2.sh # Do not check for clinfo
This would allow the GpuOwl and Prime95/MPrime install scripts to work as expected, but GpuOwl of course will not run until after Colab fixes their OpenCL issue...
In the meantime, I would recommend using our CPU only notebook and/or the old version of our GPU notebook, which ran CUDALucas. CUDALucas is slower than GpuOwl and it only supports LL tests, but it uses CUDA, which currently still works on Colab. Here is a link to that older GPU notebook: https://github.com/tdulcet/Distributed-Computing-Scripts/blob/933d9916a8bc841c3313d77a010d73e95e9dee65/google-colab/GoogleColabGPU.ipynb.
CC: @Danc2050
I just pushed that change to the GPU notebook I suggested above: https://github.com/tdulcet/Distributed-Computing-Scripts/commit/23162e2cb7055a618a909facef2c091a3f0763ba. Let me know if you have any additional questions...
Thanks for the explanation! the broken OpenCL is indeed unfortunate. I will try running the old script, then!
No problem. Yes, we are hoping they will fix OpenCL soon...
Anyway, I just tested the old version of our GPU notebook that I linked to above and it does still work as expected.
I am running the script from my Google Colab, and it seems gpuowl did not successfully installed.
This is the output that I think is the problem
And my colab instance info is this
Is this a problem with my assigned GPU?
The Prime95 installation works prerfectly by the way, and is able to get task from PrimeNet.
But when I restarted my instance and run it again, the output seems to have stuck in the Prime95 menu:
And when I try to stop the Colab cell, it outputs like this:
What is the problem here? I really interested in prime hunting and really appreciate this script! Thanks for your time!