评估中每一列分别代表什么？

eosphoros-ai / DB-GPT-Hub

A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL

MIT License

1.21k stars 169 forks source link

评估中每一列分别代表什么？ #265

Open tongcu opened 2 months ago

tongcu commented 2 months ago

请问每一列分别代表什么？我看到写的overall是0.789但是首页标注着一列是median CodeLlama-13b-Instruct-hf_lora 0.789 sft train by our this project,only used spider train dataset, the same eval way in this project with lora SFT. The weights has pulished .

CodeLlama-13B-Instruct base 0.698 0.601 0.408 0.271 0.539 lora 0.94 0.789 0.684 0.404 0.746 qlora 0.94 0.774 0.626 0.392 0.727

Kudou-Chitose commented 1 month ago

对应Method，Easy，Medium，Hard，Extra，All。不同难度的任务，不同的得分。参考https://yale-lily.github.io/spider 页面下的Data Examples。