issues
search
UKGovernmentBEIS
/
inspect_ai
Inspect: A framework for large language model evaluations
https://inspect.ai-safety-institute.org.uk/
MIT License
567
stars
98
forks
source link
Adding the official GAIA scorer && Changing default arguments on gaia_dataset
#619
Closed
max-kaufmann
closed
2 days ago
max-kaufmann
commented
2 days ago
This PR contains:
[ ] New features
[ ] Changes to dev-tools e.g. CI config / github tooling
[ ] Docs
[ ] Bug fixes
[ ] Code refactor
What is the current behavior? (You can also link to an open issue here)
What is the new behavior?
Does this PR introduce a breaking change? (What changes might users need to make in their application due to this PR?)
Other information:
This PR contains:
What is the current behavior? (You can also link to an open issue here)
What is the new behavior?
Does this PR introduce a breaking change? (What changes might users need to make in their application due to this PR?)
Other information: