open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
https://opencompass.org.cn/
Apache License 2.0
3.42k stars 358 forks source link

[Feature] ModuleNotFoundError: No module named 'opencompass.datasets.lawbench.evaluation_functions' #736

Open plutoda588 opened 7 months ago

plutoda588 commented 7 months ago

Describe the feature

我执行命令python run.py --datasets ceval_ppl mmlu_ppl --hf-path /T106/LLM_model/llama-7b --model-kwargs device_map='auto' --tokenizer-kwargs padding_side='left' truncation='left' use_fast=False --max-out-len 100 --max-seq-len 2048 --batch-size 8 --no-batch-padding --num-gpus 1 ................................... 98%|████████████████████████████████████████████████████████████████████████████████████████████▎ | 107/109 [04:13<00:01, 1.99it/s]12/25 12:59:23 - OpenCompass - WARNING - task OpenICLEval[opencompass.models.huggingface.HuggingFace_LLM_model_llama-7b/lukaemon_mmlu_conceptual_physics] fail, see ./outputs/default/20231225_125408/logs/eval/opencompass.models.huggingface.HuggingFace_LLM_model_llama-7b/lukaemon_mmlu_conceptual_physics.out 12/25 12:59:23 - OpenCompass - WARNING - task OpenICLEval[opencompass.models.huggingface.HuggingFace_LLM_model_llama-7b/lukaemon_mmlu_us_foreign_policy] fail, see ./outputs/default/20231225_125408/logs/eval/opencompass.models.huggingface.HuggingFace_LLM_model_llama-7b/lukaemon_mmlu_us_foreign_policy.out 100%|██████████████████████████████████████████████████████████████████████████████████████████████| 109/109 [04:13<00:00, 2.33s/it] 12/25 12:59:23 - OpenCompass - ERROR - /T106/model_main/opencompass-main/opencompass-main/opencompass/runners/base.py - summarize - 63 - OpenICLEval[opencompass.models.huggingface.HuggingFace_LLM_model_llama-7b/ceval-computer_network] failed with code 1 12/25 12:59:23 - OpenCompass - ERROR - /T106/model_main/opencompass-main/opencompass-main/opencompass/runners/base.py - summarize - 63 - OpenICLEval[opencompass.models.huggingface.HuggingFace_LLM_model_llama-7b/ceval-operating_system] failed with code 1 ................... ................... ................... 12/25 12:59:23 - OpenCompass - ERROR - /T106/model_main/opencompass-main/opencompass-main/opencompass/runners/base.py - summarize - 63 - OpenICLEval[opencompass.models.huggingface.HuggingFace_LLM_model_llama-7b/lukaemon_mmlu_conceptual_physics] failed with code 1 dataset
............................... File "", line 241, in _call_with_frames_removed File "/opt/conda/envs/mixtralkit/lib/python3.10/site-packages/opencompass/datasets/init.py", line 51, in from .lawbench import * # noqa: F401, F403 File "/opt/conda/envs/mixtralkit/lib/python3.10/site-packages/opencompass/datasets/lawbench/init.py", line 1, in from .lawbench import LawBenchDataset # noqa: F401 File "/opt/conda/envs/mixtralkit/lib/python3.10/site-packages/opencompass/datasets/lawbench/lawbench.py", line 10, in from .evaluation_functions import (cjft, flzx, ftcs, jdzy, jec_ac, jec_kd, ModuleNotFoundError: No module named 'opencompass.datasets.lawbench.evaluation_functions'

Will you implement it?

yoonlee888 commented 6 months ago

在跑mixtral-8x7b-v0.1的时候遇到同样的问题,想问下预计什么时候支持呢?

mdjhacker commented 3 months ago

same