feat: openai assistants

empirical-run / empirical

Test and evaluate LLMs and model configurations, across all the scenarios that matter for your application

MIT License

148 stars 13 forks source link

🦋 Changeset detected

Latest commit: 3f8c815cfe98cbe3661c9f57f8b0d7d080ca6991

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 6 packages

| Name | Type | | -------------------- | ----- | | @empiricalrun/scorer | Minor | | @empiricalrun/types | Minor | | @empiricalrun/core | Minor | | @empiricalrun/cli | Minor | | @empiricalrun/ai | Minor | | web | Minor |

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

	Run #05ef: gpt-3.5-turbo	Run #4b06: gpt-4-turbo-preview
Outputs	100%	100%
Avg latency	844ms	1508ms

Run #05ef: gpt-3.5-turbo

Run #4b06: gpt-4-turbo-preview

Outputs

100%

Avg latency

844ms

1508ms

empirical-run / empirical

feat: openai assistants #174

🦋 Changeset detected

Empirical Run Summary