Closed ggbetz closed 1 month ago
Check upon issue creation:
Parameters:
NEXT_MODEL_PATH=<org>/<model> NEXT_MODEL_REVISION=main NEXT_MODEL_PRECISION=float16 MAX_LENGTH=2048 GPU_MEMORY_UTILIZATION=0.8 VLLM_SWAP_SPACE=4
ToDos:
Check upon issue creation:
Parameters:
ToDos: