empirical-run empirical issues

empirical-run / empirical

Test and evaluate LLMs and model configurations, across all the scenarios that matter for your application

https://docs.empirical.run

MIT License

141 stars 10 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

chore: better error message for scorer name changes

#185 arjunattam closed 2 months ago
1
feat: example with cohere and llamacloud

#184 arjunattam opened 2 months ago
2
fix: py-script getting timed out

#183 saikatmitra91 closed 2 months ago
1
feat: add telemetry for usage analytics

#182 arjunattam closed 2 months ago
1
chore: change scorer names to json-syntax and llm-critic

#181 arjunattam closed 2 months ago
2
chore: update default init config for global scorers

#180 arjunattam closed 2 months ago
1
chore: update package versions for release

#179 github-actions[bot] closed 2 months ago
0
feat: ability to add global scorers

#178 saikatmitra91 closed 2 months ago
2
fix: typo in chat prompt type

#177 saikatmitra91 closed 2 months ago
1
feat: add support for threaded prompt

#176 saikatmitra91 closed 2 months ago
2
chore: update package versions for release

#175 github-actions[bot] closed 2 months ago
0
feat: openai assistants

#174 arjunattam closed 2 months ago
2
fix: timeout error tests

#173 saikatmitra91 closed 2 months ago
1
chore: minor improvements to azure openai docs

#172 arjunattam closed 2 months ago
2
chore: update package versions for release

#171 saikatmitra91 closed 2 months ago
0
feat: add azure openai provider

#170 saikatmitra91 closed 2 months ago
2
chore: update package versions for release

#169 github-actions[bot] closed 2 months ago
0
rag example: add run -> change parameter `model` to `gpt-4` -> script times out but no error shown on the UI

#168 saikatmitra91 closed 1 month ago
2
[chatbot example] - CLI summary doesnt show the second score - `personal`

#167 saikatmitra91 closed 2 months ago
3
Model latency numbers include time taken to retry

#166 saikatmitra91 closed 1 month ago
2
feat: add spider example with 150 samples

#165 arjunattam closed 1 month ago
3
chore: update package versions for release

#164 github-actions[bot] closed 2 months ago
0
chore: update $schema version for spider example

#163 arjunattam closed 2 months ago
2
feat: add ability to share the results

#162 saikatmitra91 closed 2 months ago
1
chore: update package versions for release

#161 github-actions[bot] closed 2 months ago
0
fix: throw error from anthropic response instead of returning

#160 KaustubhKumar05 closed 2 months ago
1
chore(deps): bump aiohttp from 3.9.3 to 3.9.4 in /examples/rag

#159 dependabot[bot] closed 2 months ago
1
chore: update package versions for release

#158 github-actions[bot] closed 2 months ago
0
fix: ui breaking in case of too many sample inputs

#157 saikatmitra91 closed 2 months ago
2
feat: simplify config to accept provider, scorers as top level keys

#156 KaustubhKumar05 closed 2 months ago
2
chore: update package versions for release

#155 github-actions[bot] closed 2 months ago
0
feat: add runtime configuration support in execution

#154 saikatmitra91 closed 2 months ago
2
feat: add cachedir to gitignore on init

#153 arjunattam closed 2 months ago
1
fix: add json schema URL to the default empiricalrc.json created

#152 saikatmitra91 closed 2 months ago
1
chore: update package versions for release

#151 github-actions[bot] closed 2 months ago
0
fix: cli package downloading unknown fetch package

#150 saikatmitra91 closed 2 months ago
1
chore: update schema upload action to purge cache for latest url

#149 KaustubhKumar05 closed 2 months ago
2
feat: show latency and tokens summary for runs

#148 arjunattam closed 2 months ago
1
chore: update package versions for release

#147 github-actions[bot] closed 2 months ago
0
fix: run metadata storage throwing error for second run

#146 saikatmitra91 closed 2 months ago
1
fix: add fetch library which supports retry and timeout

#145 saikatmitra91 closed 2 months ago
1
fix: recursively replace placeholders and allow whitespaces in them

#144 arjunattam closed 2 months ago
1
fix: run score summary

#143 saikatmitra91 closed 2 months ago
1
chore: nicer message string for sql-syntax scorer

#142 arjunattam closed 2 months ago
2
chore: add log line to spider prepare.py

#141 arjunattam closed 2 months ago
2
feat: add execution accuracy in spider example

#140 arjunattam closed 2 months ago
2
feat: support for openai models on azure

#139 arjunattam closed 2 months ago
1
fix: remove log from csv dataset loader

#138 saikatmitra91 closed 2 months ago
1
feat: add support for parameters in gemini models

#137 KaustubhKumar05 closed 2 months ago
2
fix: sample inputs tab ui

#136 saikatmitra91 closed 2 months ago
1

Previous Next