OpenBioLink / ThoughtSource

A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/
MIT License
867 stars 69 forks source link

improved evaluation function, simple checks #88

Closed KonstantinHebenstreit closed 1 year ago

KonstantinHebenstreit commented 1 year ago

Adjusted the evaluation function for outputs of the nightly model of Cohere. General improvements of evaluation function. Now doing more simple checks before going into regex comparisons.