Closed leocnj closed 1 day ago
Hi @leocnj, Thanks for the issue. We are investigating.
Hi @leocnj,
This is not a false alarm.
For Java and JavaScript categories, before querying the model, we do some pre-processing on the prompt and function document. Specifically, at the end of the prompt, we will explicitly state that the provided function is in Java 8/JavaScript syntax
. We will also change the parameter type to String
(since String
is JSON compatible) and add in the parameter description that "This is Java/Javascript" + {original_type} + " in string representation."
So in your example, the model should provide a JavaScript List in the string representation, eg '["completed", "failed"]'
.
@HuanzhiMao , thanks very much for your prompt answer.
In Hammer model, I found the code piece you referred to. Given such change to the function spec, I agree that model output (list) is not a false alarm here.
When using latest BFCL v3 to evaluate
MadeAgents_Hammer2.0-1.5b
model, I found that the model fails on thejavascript_2
sample. However, when checking details, it looks that this error is a false alarm -status
argument needs be an array containing String items. However, when checkingstatus
argument's type, the eval code seems incorrectly thinks the type needs to be a String.