Closed hodlen closed 10 months ago
Also, I set the default batch size to 32 (instead of 512) to avoid CUDA OOM at the prompt phase to improve server stability.
Also, I set the default batch size to 32 (instead of 512) to avoid CUDA OOM at the prompt phase to improve server stability.