fix: py-script execute's return type should be consistent with scorer…

empirical-run / empirical

Test and evaluate LLMs and model configurations, across all the scenarios that matter for your application

https://docs.empirical.run

MIT License

146 stars 11 forks source link

Closed KaustubhKumar05 closed 4 months ago

changeset-bot[bot] commented 4 months ago

Latest commit: 87103bb3509bc6f88c87f84b7dccc929df08b11f

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 2 packages

| Name | Type | | ------------------ | ----- | | @empiricalrun/core | Minor | | @empiricalrun/cli | Patch |

github-actions[bot] commented 4 months ago

Stats	Run #8bb0: gpt-3.5-turbo	Run #30ed: gpt-4-turbo-preview
outputs	100%	100%
is-json	100%	100%

Total dataset samples: 2