issues
search
empirical-run
/
empirical
Test and evaluate LLMs and model configurations, across all the scenarios that matter for your application
https://docs.empirical.run
MIT License
146
stars
11
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
fix: add support for alternate port for web UI
#89
saikatmitra91
closed
5 months ago
1
test: make mistral test less flaky
#88
arjunattam
closed
5 months ago
1
chore: add repo link to package.json files
#87
arjunattam
closed
5 months ago
1
feat: add parameters in run config ui
#86
saikatmitra91
closed
5 months ago
1
feat: add interactivity to the UI
#85
saikatmitra91
closed
5 months ago
1
chore: improve error message copy
#84
arjunattam
closed
5 months ago
1
chore(deps): bump pillow from 10.2.0 to 10.3.0 in /examples/rag
#83
dependabot[bot]
closed
5 months ago
1
chore(deps-dev): bump vite from 5.1.6 to 5.2.8
#82
dependabot[bot]
closed
5 months ago
1
chore(deps): bump langchain-core from 0.1.33 to 0.1.35 in /examples/rag
#81
dependabot[bot]
closed
5 months ago
1
chore(deps): bump express from 4.18.3 to 4.19.2
#80
dependabot[bot]
closed
5 months ago
0
chore: fix docs for script based runs
#79
saikatmitra91
closed
5 months ago
1
fix: humaneval tests broke
#78
arjunattam
closed
5 months ago
1
fix: throw openai error response
#77
KaustubhKumar05
closed
5 months ago
1
chore: clean up humaneval example
#76
arjunattam
closed
5 months ago
1
fix: top bar is not sticky
#75
KaustubhKumar05
closed
5 months ago
1
feat: add support for passthrough script executor config
#74
saikatmitra91
closed
5 months ago
1
fix: rename py script executor value property to path
#73
saikatmitra91
closed
5 months ago
1
fix: openai gets stuck when balance is negative
#72
arjunattam
closed
5 months ago
1
docs: add links to ci/cd guide
#71
arjunattam
closed
5 months ago
1
feat: support model parameters
#70
arjunattam
closed
5 months ago
2
feat: add fireworks model provider
#69
arjunattam
closed
5 months ago
2
chore: update package versions for release
#68
github-actions[bot]
closed
5 months ago
0
fix: python scorer should prefer config.name
#67
arjunattam
closed
5 months ago
1
chore: update package versions for release
#66
github-actions[bot]
closed
5 months ago
0
fix: reduce bundle size of cli package
#65
saikatmitra91
closed
5 months ago
1
chore: update package versions for release
#64
github-actions[bot]
closed
5 months ago
0
chore: update npm package licenses
#63
saikatmitra91
closed
5 months ago
1
chore: update license
#62
saikatmitra91
closed
5 months ago
1
fix: monaco blocking page scroll
#61
KaustubhKumar05
closed
5 months ago
1
chore: stop changesets from publishing release by default
#60
saikatmitra91
closed
5 months ago
1
chore: update package versions for release
#59
github-actions[bot]
closed
5 months ago
0
feat: add run summary to run header
#58
saikatmitra91
closed
5 months ago
1
feat: add llm-faithfulness scorer
#57
arjunattam
opened
5 months ago
1
feat: support cli runs on github actions
#56
arjunattam
closed
5 months ago
2
chore: update package versions for release
#55
github-actions[bot]
closed
5 months ago
0
chore: ignore changeset file changes for gh build step
#54
saikatmitra91
closed
5 months ago
1
chore: add npmignore file in package generators
#53
saikatmitra91
closed
5 months ago
1
chore: update package versions for release
#52
github-actions[bot]
closed
5 months ago
0
chore: fix npm publish script name
#51
saikatmitra91
closed
5 months ago
1
chore: add npm publish script
#50
saikatmitra91
closed
5 months ago
1
docs: fix og image
#49
arjunattam
closed
5 months ago
1
feat: pick env vars from dotenv files
#48
arjunattam
closed
5 months ago
1
feat: default name for run if not specified
#47
arjunattam
closed
5 months ago
1
fix: update prompt in basic example and default init config
#46
arjunattam
closed
5 months ago
1
docs: update docs, examples, schema for scorer interface
#45
arjunattam
closed
5 months ago
1
chore: interrupt watch routine when new change is detected
#44
arjunattam
closed
5 months ago
1
fix: support aliases for claude 3 models
#43
arjunattam
closed
5 months ago
1
feat: runs store for a dataset
#42
arjunattam
closed
4 months ago
1
fix: sample card broke after input schema change
#41
arjunattam
closed
5 months ago
1
chore: update pull request events for pr message lint gh action
#40
saikatmitra91
closed
5 months ago
1
Previous
Next