Open dfsbora opened 4 weeks ago
Evaluate valid formula, check for labels/refs/formulas in original text, etc
EvaluatePrompt class' run_all method (in utils.py) returns a dictionary with the metrics:
Still want to improve the formula comparison to account for slight modifications, rather than a binary check.
Evaluate valid formula, check for labels/refs/formulas in original text, etc