Question about AST evaluation for Java and JavaScript

Hi @lucenzhong, GPT models in FC mode don't need to have their result parsed for Java and JavaScript categories, because the model output is in JSON format and all the parameters involved with the Java/JS test are of type string, which is JSON compatible. So here, json.load is enough to get the result ready to be fed into the evaluation pipeline. The actual parsing part (to turn parameter value from string into their 'real' type) for these parameters happens in the checker section(here), which involves calling the Java type converter and the JS type converter. These two converters both make use of the tree-sitter. Let me know if you have more questions!

ps, this is not related to your question, but these two lines in the code section you referenced are not necessary and should not exist (sorry I just noticed). They will introduce false positives to the result because they are double type-casting there (namely, they are giving the model a 'second chance' when their parameter type is wrong). This issue will affect quite a few models, not just the OpenAI family. We will roll out a fix for that soon.

ShishirPatil / gorilla

Question about AST evaluation for Java and JavaScript #477