请问官方是否有对Qwen2.5-Coder-7B-Instruct做过FIM相关数据集的评测？

QwenLM / Qwen2.5-Coder

Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.

3.03k stars 202 forks source link

Closed kartikzheng closed 3 weeks ago

kartikzheng commented 1 month ago

对Qwen2.5-Coder-7B-Instruct进行FIM santacoder数据集的评测，发现相比humaneval数据集评测，低了有10个百分点左右，特别是python语言，pass@1只有50%左右。而对比其他代码大模型，pass@1并没有明显的下滑。

cyente commented 1 month ago

Could you please provide us with the specific evaluation script and the prompts that have been tested? We will check it out.

cyente commented 3 weeks ago

closed for no reply