Fixes #333, a bug in which lmql serve ignored CUDA_VISIBLE_DEVICES when figuring out what to do with its --layout argument. I've tested this in a scenario where I had devices 0 and 2 on a given machine; without this fix, setting --layout 2x1 resulted in the lmql server worker processes being assigned to devices 0 and 1, which caused problems since somebody else was using device 1. With this fix, the right GPUs are used.
Fixes #333, a bug in which
lmql serve
ignoredCUDA_VISIBLE_DEVICES
when figuring out what to do with its--layout
argument. I've tested this in a scenario where I had devices 0 and 2 on a given machine; without this fix, setting--layout 2x1
resulted in the lmql server worker processes being assigned to devices 0 and 1, which caused problems since somebody else was using device 1. With this fix, the right GPUs are used.