vLLM in batch_inference.py

OpenLLMAI / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

https://openrlhf.readthedocs.io/

Apache License 2.0

1.73k stars 164 forks source link

vLLM in batch_inference.py #222

Closed CoeusMaze closed 4 months ago

CoeusMaze commented 4 months ago

The current batch_inference.oy requires vllm package, while vllm package is not in the requirements.txt and it possibly conflicts with flash attn package. Is there a way around this other than commenting out vllm every time we want to do batch inference with normal LLM?

hijkzzz commented 4 months ago

This is right, because vLLM version management is not done very well, so it leads to a lot of conflicts, I suggest

pip install vllm
pip uninstall flash_attn xgboost transformer_engine -y

wuxibin89 commented 4 months ago

Fixed in https://github.com/OpenLLMAI/OpenRLHF/pull/229