Open tdulcet opened 1 year ago
Is this still actual/useful for merging?
The main change in this PR of fixing the v6 branch on Nvidia GPUs may still be useful, if you or someone with OpenCL experience were able to finish it. However, OpenCL is completely busted with the latest Nvidia driver, so I am unable to test anything to confirm if it fixes the issue. If/When they do fix their driver, they could fix the original issue as well, eliminating the need to make this change to the v6 branch. We are still patiently waiting to see what Nvidia does...
The other minor changes in the PR, notably fixing Clang support and enabling LTO, are still very useful and should be made to the master branch as well. I was planning to make a separate PR after this was merged.
It looks like there are too many unrelated changes in this PR; I'm not inclined to merge it as-is.
Maybe some small individual fixes can be extracted as separate PR.
Yeah, as explained above, this PR is unfinished and does not currently work, which is why it is marked a draft. It has not been a priority to finish either due to OpenCL still being busted with recent Nvidia drivers on Linux.
I could remove the 3d073e09961cedefeb397484f43ab863ac37e824 commit if you were interested in merging the other fixes. Otherwise, I suppose this PR could be closed while we wait for Nvidia...
-h
output can be displayed with the FFT lengths. This currently only supports OpenCL 1.2, so it is not yet able to perform any actual tests of GpuOwl besides this help output.make gpuowl-win.exe
to build GpuOwl.Note that this was my attempt to fix the v6 branch, but it does NOT actually fix it:
I am not an OpenCL programmer, so I obviously did not correctly resolve the merge conflicts. Any help to finish this PR by correctly applying https://github.com/preda/gpuowl/commit/677f43a2ef299f0b8cc9885284fbaa086e917ce2 to the v6 branch would be greatly appreciated by Colab users and likely other people with Nvidia GPUs. Thanks in advance.