aws / fmeval

Foundation Model Evaluations Library
http://aws.github.io/fmeval
Apache License 2.0
214 stars 46 forks source link

Option to disable BERTScore in QAAccuracy #330

Open athewsey opened 1 month ago

athewsey commented 1 month ago

The addition of BERTScore to the QAAccuracy in version 1.2.0 has massively slowed down evaluation: I've seen interactive evaluations (with our custom Streamlit solution here) change from ~30 seconds to 3 minutes on upgrading; not to mention some AWS Console created jobs going from less than one hour to ~14 hours or more.

Since this scoring method appears very compute intensive compared to the others, can we have a configuration option in QAAccuracyConfig to disable it?