S14.0 - LLM Testing - Githubissues

RamiAwar / dataline

Chat with your data - AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake, SQLite...

https://dataline.app

GNU General Public License v3.0

129 stars 6 forks source link

Open anthony2261 opened 3 weeks ago

anthony2261 commented 3 weeks ago

We need to figure out a way to evaluate AI generated results so that we can compare quality if there are changes to the llm flow, prompts, etc..

Desperately needed.