defog-ai / sql-eval

Evaluate the accuracy of LLM generated outputs
Apache License 2.0
540 stars 56 forks source link

Modify result plots in slack #188

Closed wendy-aw closed 3 months ago

wendy-aw commented 3 months ago

Modified code to facet by cot inference, and allow graphs of different models to overlay each other.