awslabs / agent-evaluation

A generative AI-powered framework for testing virtual agents.
https://awslabs.github.io/agent-evaluation/
Apache License 2.0
64 stars 10 forks source link

Add pass rate metric #32

Closed tonykchen closed 1 month ago

tonykchen commented 2 months ago

Add an accuracy metric as part of the evaluation output.

pass rate = num test passed / num tests

dferguson992 commented 1 month ago

I would like to help with this!

tonykchen commented 1 month ago

See #60