Open HyunggyuJang opened 6 months ago
Hi, thanks for the question. The preprocess script is for the v1 version data, to process with v2 data (which contains both content and function_call), you can modify the script to add the content
. We will soon update the preprocess script to process with v2 data.
Thanks for your reply. Can I ask one more question? How could you manage to get a response from OpenAI model that contains both the content and function_call? To my knowledge, if the model decided to call function, the content part always null.
Hi @pooruss Could you please release the preprocessing script for the v2 data? thanks!
Specifically, to convert OpenAI's function calling syntax to Llama compatible form, preprocessing is executed.
In the code, if the assistant message contains
function_call
property, it discards thecontent
part, which is the part ofThought
in Llama compatible syntax: https://github.com/OpenBMB/ToolBench/blob/3010778f5834fde71dc7658b4d51d1023affc21d/preprocess/preprocess_toolllama_data.py#L65However, in toolllama_G123_dfs_train.json, the assistant message that contains both
content
andfunction_call
, it convered toThought: ${content}\nAction: ...
, without discarding the content part.