Closed mpizzagalli777 closed 1 month ago
Hi @mpizzagalli777 - are you able to run any other CUDA application? Can you try to run the CUDA samples from https://github.com/NVIDIA/cuda-samples ? This will help rule out if it's a setup issue or not
Sorry about the messaging mess. I ran the CUDA samples and got the following error:
CUDA error at ../../../Common/helper_cuda.h:801 code=999(cudaErrorUnknown) "cudaGetDeviceCount(&device_count)"
Do you happen to know what might be causing this? I tried reinstalling CUDA toolkit (12.4). Not sure if you might know how to troubleshoot this further
Hi, posting on the cuda samples GitHub might be helpful! I've also found from personal experience that sometimes restarting after a CUDA driver install/upgrade helps too.
I'm closing this issue for now, since it's not particular to dorado.
Thanks so much for your help. In case someone else has the same issue, I followed the download instructions here: https://developer.nvidia.com/cuda-downloads And after restarting my computer, was able to successfully run the cuda-samples test.
Issue Report
Please describe the issue:
Hi, I just installed Dorado on my workstation. I am running into trouble having it run successfully due to an unknown CUDA error. I ahve been unable to find a solution to this issue in other posts. Running nvidia-smi gives the following results. I am aware that we do not have the A100, but I read that the A6000 should work as well (although it might be a little slower).![NVIDIA-smi](https://github.com/nanoporetech/dorado/assets/154617296/e454286f-a243-4031-aa57-6ce8b28de203)
Here is the output of ldd dorado![ldd_dorado](https://github.com/nanoporetech/dorado/assets/154617296/0c5c3bfd-ac50-4a20-b5a1-1b451ef532c7)
Steps to reproduce the issue:
Please list any steps to reproduce the issue.
Run environment:
Please let me know if there is any other information that might be useful.