Open jiaozhentian opened 2 years ago
I tried to use nvml.DeviceGetHandleBySerial instead nvml.DeviceGetHandleByUUID, it works smoothly, have no idea why the function of uuid went wrong sometimes.
@jiaozhentian it may be that @klueska addressed this in https://github.com/NVIDIA/go-nvml/pull/48. Would you be able to try with the latest version?
Hi. This problem still persists. Do you know any workarounds?
I wrote some code to get the status of the GPU, where get handle by uuid. But sometimes the code will get the ERROR_NOT_FOUND (6) of the error, it not always happened. here are my codes:
I try to address it by restart the nvml connection service in codes, but it still get that wrong. However, when the function is over, I do not stop debug, I give it a uuid by gRPC,, DeviceGetHandByUUID can work normally, that is wired. Anyone help me to fix this bugs?