truera / trulens

Evaluation and Tracking for LLM Experiments
https://www.trulens.org/
MIT License
1.84k stars 163 forks source link

Enable regex to extract floats in score generation #1223

Closed sfc-gh-dhuang closed 2 weeks ago

sfc-gh-dhuang commented 2 weeks ago

Items to add to release announcement:

Failing example when testing groundedness feedback functions - where the score accompanying COT was interpreted as 0, instead of the expected 10:

0.0,
 {'reasons': 'STATEMENT 0:\nCriteria: I am a man and I love fish,\nSupporting Evidence: The source states "All men love fish", and the statement contains "I am a man and I love fish". The source contains the information that a man loves fish, and the statement contains the information that the speaker is a man and loves fish.\nScore: 10.0\n'}

I'm switching to PATTERN_NUMBER to unblock for now.

Other details that are good to know but need not be announced: