Every GPU has different memory capabilities, but the current implementation doesn't account for this. We should compute (or take in as a flag) the amount of GPU memory per GPU, and automatically spin up the correct number of workers for the amount of GPU.
Every GPU has different memory capabilities, but the current implementation doesn't account for this. We should compute (or take in as a flag) the amount of GPU memory per GPU, and automatically spin up the correct number of workers for the amount of GPU.