Open zthang opened 10 months ago
same request here! cannot reproduce the result on the paper when evaluating by whether the lowercase version of the output is in the lower case of the gold answer.
Line 131 "examples = self_consistency(examples, anchor_text=anchor_text)" in prep.py file. This line reports an error. The error message is that the self_consistency function is not defined. Can you please tell me if the code for this function is publicly available. I can't find this function in any of the files.
Good work! Can you provide the evaluation code for the results presented in the paper (exactly math, f1, recall and precision)? Thanks!