Open Leo20100307 opened 2 weeks ago
请问日志中有error相关字样的log么? 如有则可以进到outputs相对应的logs文件夹中查看对应的error明细 / Please check the error log file in the outputs directory and get details of err msg.
另外请check一下,评测相关的data是否有预先准备: 参考 https://evalscope.readthedocs.io/zh-cn/latest/user_guides/backend/opencompass_backend.html
请问日志中有error相关字样的log么? 如有则可以进到outputs相对应的logs文件夹中查看对应的error明细 / Please check the error log file in the outputs directory and get details of err msg.
outputs目录下,有个txt文档,里面没有看到报错。日志文件80M,无法上传。
vllm端有打印,模型应该是有接收到请求并做了处理:
另外请check一下,评测相关的data是否有预先准备: 参考 https://evalscope.readthedocs.io/zh-cn/latest/user_guides/backend/opencompass_backend.html
数据文件已经下载,,并解压到当前目录下,目录名称"data"
data目录下的数据集文件
问题描述 / Issue Description
请简要描述您遇到的问题。 / Please briefly describe the issue you encountered.
本地/root/ChatGLM目录下载的ChatGLM2-6B模型,
使用vllm部署server:
vllm serve /root/ChatGLM --chat-template ./examples/template_chatglm2.jinja --trust_remote_code --use-v2-block-manager
evalscope相关配置:
(evalscope) root@ubuntu:~/evalscope# cat eval_openai_api.yaml eval_backend: OpenCompass eval_config: datasets:
temperature: 0.0
(evalscope) root@ubuntu:~/evalscope# cat example_eval_openai_api.py from evalscope.run import run_task from evalscope.summarizer import Summarizer
def run_eval():
Option 1: Python dictionary
run_eval()
使用的工具 / Tools Used
执行的代码或指令 / Code or Commands Executed
请提供您执行的主要代码或指令。 / Please provide the main code or commands you executed. 例如 / For example:
执行测试: python example_eval_openai_api.py
错误日志 / Error Log
请粘贴完整的错误日志或控制台输出。 / Please paste the full error log or console output. 例如 / For example:
dataset version metric mode /root/ChatGLM
--------- 考试 Exam --------- - - - - ceval - - - - cmb - - - - agieval - - - - mmlu - - - - GaokaoBench - - - - ARC-c - - - - ARC-e - - - - --------- 语言 Language --------- - - - - WiC - - - - summedits - - - - chid-dev - - - - afqmc-dev - - - - bustm-dev - - - - cluewsc-dev - - - - WSC - - - - winogrande - - - - flores_100 - - - - --------- 知识 Knowledge --------- - - - - BoolQ - - - - commonsense_qa - - - - nq - - - - triviaqa - - - - --------- 推理 Reasoning --------- - - - - cmnli - - - - ocnli - - - - ocnli_fc-dev - - - - AX_b - - - - AX_g - - - - CB - - - - RTE - - - - story_cloze - - - - COPA - - - - ReCoRD - - - - hellaswag - - - - piqa - - - - siqa - - - - strategyqa - - - - math - - - - gsm8k - - - - TheoremQA - - - - openai_humaneval - - - - mbpp - - - - bbh - - - - --------- 理解 Understanding --------- - - - - C3 - - - - CMRC_dev - - - - DRCD_dev - - - - MultiRC - - - - race-middle - - - - race-high - - - - openbookqa_fact - - - - csl_dev - - - - lcsts - - - - Xsum - - - - eprstmt-dev - - - - lambada - - - - tnews-dev - - - - 11/07 07:06:42 - OpenCompass - INFO - write summary to /root/evalscope/outputs/default/20241107_070629/summary/summary_20241107_070629.txt 11/07 07:06:42 - OpenCompass - INFO - write csv to /root/evalscope/outputs/default/20241107_070629/summary/summary_20241107_070629.csv
运行环境 / Runtime Environment
操作系统 / Operating System:
Python版本 / Python Version:
其他信息 / Additional Information
如果有其他相关信息,请在此处提供。 / If there is any other relevant information, please provide it here.