CUDA_VISIBLE_DEVICES: "RRGPU-0aaaaaaa-74d2-7297-d557-12771b6a79d5,GPU-0bbbbbbb-74d2-7297-d557-12771b6a79d5,GPU-0ccccccc-74d2-7297-d557-12771b6a79d5,GPU-0ddddddd-74d2-7297-d557-12771b6a79d5" does NOT work, but should work
The GPU UUIDs (visible with nvidia-smi -L) should be drop-in replacements for numeric IDs.
I prefer to use UUIDs because they are more deterministic (reliable) when environments and cards are moved between systems.
Describe how you solve it
I currently use numeric IDs, but those are unreliable as I move GPUs around often.
Describe the bug
CUDA_VISIBLE_DEVICES: "RR1,3,5,7"
works, butCUDA_VISIBLE_DEVICES: "RRGPU-0aaaaaaa-74d2-7297-d557-12771b6a79d5,GPU-0bbbbbbb-74d2-7297-d557-12771b6a79d5,GPU-0ccccccc-74d2-7297-d557-12771b6a79d5,GPU-0ddddddd-74d2-7297-d557-12771b6a79d5"
does NOT work, but should workThe GPU UUIDs (visible with
nvidia-smi -L
) should be drop-in replacements for numeric IDs.I prefer to use UUIDs because they are more deterministic (reliable) when environments and cards are moved between systems.
Describe how you solve it
I currently use numeric IDs, but those are unreliable as I move GPUs around often.
Environment
Screenshots N/A