issues
search
empirical-run
/
empirical
Test and evaluate LLMs and model configurations, across all the scenarios that matter for your application
https://docs.empirical.run
MIT License
141
stars
10
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
chore: better error message for scorer name changes
#185
arjunattam
closed
2 months ago
1
feat: example with cohere and llamacloud
#184
arjunattam
opened
2 months ago
2
fix: py-script getting timed out
#183
saikatmitra91
closed
2 months ago
1
feat: add telemetry for usage analytics
#182
arjunattam
closed
2 months ago
1
chore: change scorer names to json-syntax and llm-critic
#181
arjunattam
closed
2 months ago
2
chore: update default init config for global scorers
#180
arjunattam
closed
2 months ago
1
chore: update package versions for release
#179
github-actions[bot]
closed
2 months ago
0
feat: ability to add global scorers
#178
saikatmitra91
closed
2 months ago
2
fix: typo in chat prompt type
#177
saikatmitra91
closed
2 months ago
1
feat: add support for threaded prompt
#176
saikatmitra91
closed
2 months ago
2
chore: update package versions for release
#175
github-actions[bot]
closed
2 months ago
0
feat: openai assistants
#174
arjunattam
closed
2 months ago
2
fix: timeout error tests
#173
saikatmitra91
closed
2 months ago
1
chore: minor improvements to azure openai docs
#172
arjunattam
closed
2 months ago
2
chore: update package versions for release
#171
saikatmitra91
closed
2 months ago
0
feat: add azure openai provider
#170
saikatmitra91
closed
2 months ago
2
chore: update package versions for release
#169
github-actions[bot]
closed
2 months ago
0
rag example: add run -> change parameter `model` to `gpt-4` -> script times out but no error shown on the UI
#168
saikatmitra91
closed
1 month ago
2
[chatbot example] - CLI summary doesnt show the second score - `personal`
#167
saikatmitra91
closed
2 months ago
3
Model latency numbers include time taken to retry
#166
saikatmitra91
closed
1 month ago
2
feat: add spider example with 150 samples
#165
arjunattam
closed
1 month ago
3
chore: update package versions for release
#164
github-actions[bot]
closed
2 months ago
0
chore: update $schema version for spider example
#163
arjunattam
closed
2 months ago
2
feat: add ability to share the results
#162
saikatmitra91
closed
2 months ago
1
chore: update package versions for release
#161
github-actions[bot]
closed
2 months ago
0
fix: throw error from anthropic response instead of returning
#160
KaustubhKumar05
closed
2 months ago
1
chore(deps): bump aiohttp from 3.9.3 to 3.9.4 in /examples/rag
#159
dependabot[bot]
closed
2 months ago
1
chore: update package versions for release
#158
github-actions[bot]
closed
2 months ago
0
fix: ui breaking in case of too many sample inputs
#157
saikatmitra91
closed
2 months ago
2
feat: simplify config to accept provider, scorers as top level keys
#156
KaustubhKumar05
closed
2 months ago
2
chore: update package versions for release
#155
github-actions[bot]
closed
2 months ago
0
feat: add runtime configuration support in execution
#154
saikatmitra91
closed
2 months ago
2
feat: add cachedir to gitignore on init
#153
arjunattam
closed
2 months ago
1
fix: add json schema URL to the default empiricalrc.json created
#152
saikatmitra91
closed
2 months ago
1
chore: update package versions for release
#151
github-actions[bot]
closed
2 months ago
0
fix: cli package downloading unknown fetch package
#150
saikatmitra91
closed
2 months ago
1
chore: update schema upload action to purge cache for latest url
#149
KaustubhKumar05
closed
2 months ago
2
feat: show latency and tokens summary for runs
#148
arjunattam
closed
2 months ago
1
chore: update package versions for release
#147
github-actions[bot]
closed
2 months ago
0
fix: run metadata storage throwing error for second run
#146
saikatmitra91
closed
2 months ago
1
fix: add fetch library which supports retry and timeout
#145
saikatmitra91
closed
2 months ago
1
fix: recursively replace placeholders and allow whitespaces in them
#144
arjunattam
closed
2 months ago
1
fix: run score summary
#143
saikatmitra91
closed
2 months ago
1
chore: nicer message string for sql-syntax scorer
#142
arjunattam
closed
2 months ago
2
chore: add log line to spider prepare.py
#141
arjunattam
closed
2 months ago
2
feat: add execution accuracy in spider example
#140
arjunattam
closed
2 months ago
2
feat: support for openai models on azure
#139
arjunattam
closed
2 months ago
1
fix: remove log from csv dataset loader
#138
saikatmitra91
closed
2 months ago
1
feat: add support for parameters in gemini models
#137
KaustubhKumar05
closed
2 months ago
2
fix: sample inputs tab ui
#136
saikatmitra91
closed
2 months ago
1
Previous
Next