not enough swap space issue

logikon-ai / cot-eval

A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.

MIT License

5 stars 1 forks source link

Closed ggbetz closed 3 months ago

ggbetz commented 4 months ago

when evaluating microsoft/orca-7b

ggbetz commented 4 months ago

There's an env variable to fix that, so we have to add, e.g.

VLLM_SWAP_SPACE=6

To the configuration (e.g., config.env)

ggbetz commented 4 months ago

The problem seems to arise when traces are generated with beam search. For clarification: https://github.com/vllm-project/vllm/issues/2853

ggbetz commented 3 months ago

This has been resolved.

Pipeline ran successfully for the NousResearch/Nous-Hermes-llama-2-7b model with latest docker container.

Increasing the VLLM_SWAP_SPACE, i.e., the CPU memory vllm may use for offloading during beam search, resolves this issue.

Details:

On slurm, I had to make sure to grab not only GPUs, but also enough memory allocated to the job via ntasksparameter. Before, I got OOM (not cuda oom) error.
I used VLLM_SWAP_SPACE=32