Closed ggbetz closed 6 months ago
Check upon issue creation:
Parameters:
With XXX in
NEXT_MODEL_PATH=meta-llama/Meta-Llama-3-XXX NEXT_MODEL_REVISION=main NEXT_MODEL_PRECISION=bfloat16 MAX_LENGTH=2048 GPU_MEMORY_UTILIZATION=0.8 VLLM_SWAP_SPACE=12
ToDos:
Check upon issue creation:
Parameters:
With XXX in
ToDos: