issues
search
awslabs
/
agent-evaluation
A generative AI-powered framework for testing virtual agents.
https://awslabs.github.io/agent-evaluation/
Apache License 2.0
64
stars
10
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Implement store
#74
tonykchen
opened
1 week ago
0
chore(deps): bump urllib3 from 2.2.1 to 2.2.2 in /demo
#73
dependabot[bot]
closed
1 week ago
0
chore(deps): bump tornado from 6.4 to 6.4.1 in /demo
#72
dependabot[bot]
closed
3 weeks ago
0
CICD example
#71
sharonxiaohanli
closed
1 week ago
0
1 validation error detected: Value {agentalias} failed to satisfy constraint
#70
ayenda
closed
1 month ago
1
Update CONTRIBUTING.md
#69
tonykchen
closed
1 month ago
0
Closes #15
#68
dferguson992
closed
3 weeks ago
0
bug: red traffic light dots for tests that passed
#67
jotelfor
opened
1 month ago
0
Bugfix for throttlingException when calling the InvokeAgent operation
#66
johntelforduk
opened
1 month ago
1
In concurrent.futures.ThreadPoolExecutor set max_workers correctly.
#65
johntelforduk
closed
1 month ago
0
throttlingException when calling the InvokeAgent operation
#64
tonykchen
opened
1 month ago
0
ValidationException when calling the InvokeAgent operation
#63
tonykchen
opened
1 month ago
0
Refactor and improve coverage
#62
tonykchen
closed
1 month ago
0
chore(deps): bump requests from 2.31.0 to 2.32.0 in /demo
#61
dependabot[bot]
closed
3 weeks ago
0
Pass Rate Metric
#60
dferguson992
closed
1 month ago
0
add a demo ui
#59
sharonxiaohanli
closed
1 month ago
0
Bump to 0.2.0
#58
tonykchen
closed
1 month ago
0
Fix custom target import
#57
tonykchen
closed
1 month ago
0
Enable support for Q Business applications w/ IAM Identity Center
#56
tonykchen
closed
1 month ago
0
Add CI/CD example
#55
tonykchen
opened
1 month ago
0
Save run artifacts to S3
#54
tonykchen
opened
1 month ago
1
Doc updates
#53
tonykchen
closed
1 month ago
0
Unable to import TargetResponse from 'agenteval.targets'
#52
tengfone
closed
1 month ago
1
chore: set package version using importlib_metadata
#51
tonykchen
closed
1 month ago
0
Import package version using importlib_metadata
#50
tonykchen
closed
1 month ago
0
Update changelog with `0.1.0`
#49
tonykchen
closed
1 month ago
0
Add site analytics
#48
tonykchen
closed
1 month ago
0
Update docs
#47
tonykchen
closed
1 month ago
0
Add dispatch event for docs and pypi workflows
#46
tonykchen
closed
2 months ago
0
Add provisioned throughput support for targets on Amazon Bedrock
#45
tonykchen
opened
2 months ago
0
Major project refactor
#44
tonykchen
closed
2 months ago
0
Set a safe default max num of threads
#43
sharonxiaohanli
closed
2 months ago
0
Revamp user documentation
#42
tonykchen
closed
2 months ago
0
Add a demo UI
#41
sharonxiaohanli
closed
2 months ago
0
Add release workflows
#40
tonykchen
closed
2 months ago
0
Add release workflows
#39
tonykchen
closed
2 months ago
0
add bedrock knowledgebase as target
#38
sharonxiaohanli
closed
2 months ago
0
Implement release pipelines
#37
tonykchen
closed
2 months ago
0
Save run outputs to an artifacts directory
#36
tonykchen
closed
2 months ago
0
Implement backoff strategy
#35
tonykchen
closed
2 months ago
0
Use `.jinja` extension
#34
tonykchen
closed
2 months ago
0
Remove autoescaping for templates
#33
tonykchen
closed
2 months ago
0
Add pass rate metric
#32
tonykchen
closed
1 month ago
2
Global test cases
#31
tonykchen
opened
2 months ago
0
Add exceptions to trace
#30
tonykchen
opened
2 months ago
0
Fix security vulnerabilities
#29
tonykchen
closed
2 months ago
0
Fix inconsistent evaluation results
#28
tonykchen
closed
2 months ago
1
add work-dir as a configurable param
#27
sharonxiaohanli
closed
2 months ago
1
add work-dir as cli parameter
#26
sharonxiaohanli
closed
2 months ago
0
Pre-release tasks
#25
tonykchen
closed
2 months ago
0
Next