Closed ByteCaprice closed 5 months ago
前因后果讲清楚,没明白你这个是运行什么导致的。
前因后果讲清楚,没明白你这个是运行什么导致的。 用vllm下的inference_hf.py文件进行推理时产生的,显存快满的时候会不停的警告
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your consideration.
Closing the issue, since no updates observed. Feel free to re-open if you need any further assistance.
提交前必须检查以下项目
问题类型
效果问题
基础模型
Chinese-LLaMA-2 (7B/13B)
操作系统
Linux
详细描述问题
依赖情况(代码类问题务必提供)
运行日志或截图