Adapts the evaluation functions so they run with promptfoo
Still to do
Potentially use the actual functions from src/evaluation.py (or decide to use these as the default). Should be simple, but might require some tweaking of PYTHONPATH in the promptfoo call.
The expected format needs updating to reflect the current state of the prompt
Fixes #81 .
Adapts the evaluation functions so they run with promptfoo
Still to do
src/evaluation.py
(or decide to use these as the default). Should be simple, but might require some tweaking of PYTHONPATH in the promptfoo call.Pull request checklist
main