OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
srun: job 9603110 queued and waiting for resources
srun: job 9603110 has been allocated resources
srun: Job 9603110 scheduled successfully!
Current QUOTA_TYPE is [reserved], which means the job has occupied quota in RESERVED_TOTAL under your partition.
Current PHX_PRIORITY is normal
/mnt/petrelfs/gaohongzhi/anaconda3/envs/xtuner/lib/python3.10/site-packages/colossalai/kernel/cuda_native/mha/flash_attn_2.py:28: UserWarning: please install flash_attn from https://github.com/HazyResearch/flash-attention
warnings.warn("please install flash_attn from https://github.com/HazyResearch/flash-attention")
/mnt/petrelfs/gaohongzhi/anaconda3/envs/xtuner/lib/python3.10/site-packages/colossalai/kernel/cuda_native/mha/mem_eff_attn.py:15: UserWarning: please install xformers from https://github.com/facebookresearch/xformers
warnings.warn("please install xformers from https://github.com/facebookresearch/xformers")
[2023-10-30 14:30:36,299] [INFO] [real_accelerator.py:158:get_accelerator] Setting ds_accelerator to cuda (auto detect)
10/30 14:30:40 - OpenCompass - INFO - Task [opencompass.models.huggingface.HuggingFace_models_chatglm2-6b/tyidqa-goldp_japanese,opencompass.models.huggingface.HuggingFace_models_chatglm2-6b/tyidqa-goldp_english,opencompass.models.huggingface.HuggingFace_models_chatglm2-6b/tyidqa-goldp_korean,opencompass.models.huggingface.HuggingFace_models_chatglm2-6b/tyidqa-goldp_bengali]
Loading checkpoint shards: 0%| | 0/7 [00:00<?, ?it/s]
Loading checkpoint shards: 14%|█▍ | 1/7 [00:03<00:22, 3.75s/it]
Loading checkpoint shards: 29%|██▊ | 2/7 [00:07<00:19, 3.92s/it]
Loading checkpoint shards: 43%|████▎ | 3/7 [00:11<00:15, 3.93s/it]
Loading checkpoint shards: 57%|█████▋ | 4/7 [00:15<00:11, 3.87s/it](null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
Loading checkpoint shards: 71%|███████▏ | 5/7 [00:19<00:08, 4.02s/it]
Loading checkpoint shards: 86%|████████▌ | 6/7 [00:23<00:04, 4.02s/it]
Loading checkpoint shards: 100%|██████████| 7/7 [00:26<00:00, 3.43s/it]
Loading checkpoint shards: 100%|██████████| 7/7 [00:26<00:00, 3.72s/it]
10/30 14:31:20 - OpenCompass - INFO - Start inferencing [opencompass.models.huggingface.HuggingFace_models_chatglm2-6b/tyidqa-goldp_japanese]
[2023-10-30 14:31:21,082] [opencompass.openicl.icl_inferencer.icl_gen_inferencer] [INFO] Starting inference process...
0%| | 0/4 [00:00<?, ?it/s](null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
(null): _log_init: Unable to open logfile `/var/log/slurm/slurmd.log': No such file or directory
其他信息
I don't have the permission to edit /var/log/slurm/slurmd.log file.
先决条件
问题类型
我正在使用官方支持的任务/模型/数据集进行评估。
环境
重现问题 - 代码/配置示例
重现问题 - 命令或脚本
bash
重现问题 - 错误信息
其他信息
I don't have the permission to edit
/var/log/slurm/slurmd.log
file.