evals Search Results - Githubissues

1000+ results
for evals

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

UKGovernmentBEIS/inspect_ai #672

.json file is missing all messages when an error is thrown

If an error is thrown during execution, the entire trace is lost, include debug messages. Instead, all messages up to the error, as well as all debug messages, should be included in the log. (Ideall…

JasonGross updated 3 weeks ago
1
sweepai/evals #6

Sweep: refactor the training loop from a script to a class i…

### Details _No response_ Checklist - [X] Modify `src/main.py` ✓ https://github.com/sweepai/evals/commit/35f37cdbd49a63bfdd7a39e2d65b0c186ad83c49 - [X] Ran sandbox for `src/main.py`. ✓ https://gi…

wwzeng1 updated 11 months ago
1
gfx-rs/wgpu #6302

WGSL: Const. eval. short-circuiting

**Description** Naga currently rejects programs with constant expressions such as this one: ```wgsl const asdf = false && (123 / 0 > 0); ``` An uninformed reader might assume that divid…

ErichDonGubler updated 4 weeks ago
2
facebookresearch/MobileLLM #2

eval issues?

i think the readme.md has some issues regarding to the evals i just notice it with piqa, the numbers are too low compared to the actual paper

appvoid updated 3 months ago
2
abacaj/code-eval #11

Any plans on running evals for codellama?

I'm keeping https://github.com/ErikBjare/are-copilots-local-yet up-to-date, and would love to see some codellama numbers given it's now SOTA :)

ErikBjare updated 10 months ago
2
robertfeldt/BlackBoxOptim.jl #119

Parallel example fails on julia 1.1

This is what I get with `master`: ``` ~\.julia\dev\BlackBoxOptim\examples [master ≡]> julia .\rosenbrock_parallel.jl Starting optimization with optimizer XNESOpt{Float64,RandomBound{ContinuousRectS…

davidanthoff updated 5 years ago
6
openai/evals #1469

Support for Azure OpenAI client

### Describe the feature or improvement you're requesting Currently evals framework does not support Azure openAI implementation. This is blocker if someone wants to use eval with Azure OpenAI implem…

pkt1583 updated 7 months ago
2
LilithHafner/Chairmarks.jl #102

Detect cases where first eval is slower than subsequent eval…

If I have something like `@b rand(1000) sort!`, the first eval is much slower than subsequent evals within a given sample, which violates benchmarking assumptions and results in weird results. For exa…

LilithHafner updated 5 months ago
1
archignes/searchevals #1

implement an input helper to validate new evals

danielsgriffin updated 8 months ago
2
sweepai/evals #11

Sweep: add comments and docstrings to main.py and api.py

### Details _No response_ Checklist - [X] Modify `src/main.py` ✓ https://github.com/sweepai/evals/commit/79c50514c76cc63da87009fa58909bf838a262c9 - [X] Ran sandbox for `src/main.py`. ✗ - [X] Modi…

wwzeng1 updated 11 months ago
1

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for evals

1000+ results
for evals