logikon-ai / cot-eval

A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.
https://huggingface.co/spaces/logikon/open_cot_leaderboard
MIT License
5 stars 1 forks source link

not enough swap space issue #10

Closed ggbetz closed 3 months ago

ggbetz commented 4 months ago

when evaluating microsoft/orca-7b

ggbetz commented 4 months ago

see also https://github.com/vllm-project/vllm/issues/787

There's an env variable to fix that, so we have to add, e.g.

VLLM_SWAP_SPACE=6

To the configuration (e.g., config.env)

ggbetz commented 4 months ago

The problem seems to arise when traces are generated with beam search. For clarification: https://github.com/vllm-project/vllm/issues/2853

ggbetz commented 3 months ago

This has been resolved.

Pipeline ran successfully for the NousResearch/Nous-Hermes-llama-2-7b model with latest docker container.

Increasing the VLLM_SWAP_SPACE, i.e., the CPU memory vllm may use for offloading during beam search, resolves this issue.

Details: