Integrate Evaluation Frameworks

munkim / monoid

Turn APIs into AI Agents

https://monoid.so

Apache License 2.0

47 stars 16 forks source link

Integrate Evaluation Frameworks #17

Open edwisdom opened 9 months ago

edwisdom commented 9 months ago

Description

There are a number of open-source frameworks that allow you to evaluate your Agent's performance on a set of tasks (e.g. Phoenix, ToolBench, LangFuse). Users may want to connect to one of these in order to do more thorough evaluation (beyond sandbox testing) of their Agent before deployment.