TransformerOptimus / SuperAGI

<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
https://superagi.com/
MIT License
15.44k stars 1.86k forks source link

A/B Test LLMs? #1182

Open krrishdholakia opened 1 year ago

krrishdholakia commented 1 year ago

Hi @Fluder-Paradyne @Tarraann @jedan2506

In production, we don't know if Llama2 is going to provide:

Would it be helpful to provide a way to easily A/B test between new models in production?

Context - I'm working on LiteLLM and we recently released a way to a/b test straight from the completion endpoint:

Tutorial: https://docs.litellm.ai/docs/tutorials/ab_test_llms

jedan2506 commented 1 year ago

Thank you for your valuation feedback. We will surely look into the information you provided and have a detailed discussion on it.