Open EntroSanity opened 4 months ago
This pull request updates validateVram function in llm starter script to check for total available vram when user intends to deploy large models across multiple gpus; it comes with a usage example in README
validateVram
README
This pull request updates
validateVram
function in llm starter script to check for total available vram when user intends to deploy large models across multiple gpus; it comes with a usage example inREADME