xingyaoww / code-act

Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
MIT License
468 stars 38 forks source link

Inquiry regarding to Mistral model results #10

Open xingjianleng opened 3 months ago

xingjianleng commented 3 months ago

Thanks for your great work!

I am interested in the CodeAct framework and would like to reproduce Table 2 in the original paper, the results of atomic API call correctness on API-Bank.

I saw that experiments were conducted on Mistral-7B-Instruct-v0.1. I followed the instructions for API-Bank evaluation preparation in the repo. However, as far as I know, all LLMs developed by Mistral do not support system prompts. Directly running the run.sh evaluation script would lead to an error, stating the incorrect formatting of the messages.

I am wondering how to obtain values in the table for Mistral-7B-Instruct-v0.1.