evidentlyai / evidently

Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
https://www.evidentlyai.com/evidently-oss
Apache License 2.0
5.44k stars 602 forks source link

Can this be used for LLMs? #608

Open kf-rahman opened 1 year ago

elenasamuylova commented 1 year ago

Hi @kf-rahman

If you run LLM models in production, you can use Evidently to evaluate and monitor input text data quality, text data drift, and embedding drift.

We do not have any LLM-specific evaluation metrics yet, but depending on the task, you can also use Evidently to evaluate the model quality (e.g., use Classification Presets if you work on text classification tasks).

Which use case do you have in mind?

ketangangal commented 1 year ago

Hello @kf-rahman ,

Its very handy to use for LLMs all you need to do is Generate embeddings with you llm model and use evidently for monitoring. You can also refer : https://docs.evidentlyai.com/user-guide/customization/embeddings-drift-parameters

FYI @elenasamuylova