evals Search Results - Githubissues

1000+ results
for evals

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/promptflow #3382

[BUG] Function `evaluate(...)` from promptflow.evals.evaluat…

**Describe the bug** I followed the example in MSDocs [Evaluate on test dataset using `evaluate()`](https://learn.microsoft.com/en-us/azure/ai-studio/how-to/develop/flow-evaluate-sdk#evaluate-on-test…

megel updated 5 days ago
4
openai/evals #138

Music evals

It could be interesting to explore if we could use [MusPy](https://salu133445.github.io/muspy/) to add some text/symbolic music evals. /cc @salu133445

bhack updated 1 year ago
1
zeno-ml/zeno-evals #1

Error running Evals after installation - pydantic library

I used OpenAI Evals to run an eval and get a jsonl report. I then pip installed zeno-evals to view the report. But when I went back to run another eval I got the following error: ``` ImportError: …

shyam-krish updated 3 months ago
2
Arize-ai/phoenix #2149

[ENHANCEMENT] Checkpoint (persist) evals results

**Is your feature request related to a problem? Please describe.** When running evals across a large dataframe, it'd be useful to temporarily checkpoint/persist the results in case something happens …

trevor-laviale-arize updated 3 months ago
3
kirillbobyrev/pabi #227

Try Mixture of Experts for Eval

Think about using nets for different stages of the game (e.g. Stockfish has some conditionals inside the net that kind of act like "extensions" of the net based on piece count if I understand correctl…

kirillbobyrev updated 4 weeks ago
2
indus/VLEX #2

Compile Evals

V1 http://jsbin.com/duvayova/1/edit V2 http://jsbin.com/ditogatu/5/edit I had no net for a couple of days (ooooh the humanity!) so played with your code for something to do. Originally I was only int…

PAEz updated 10 years ago
15
LilithHafner/Chairmarks.jl #102

Detect cases where first eval is slower than subsequent eval…

If I have something like `@b rand(1000) sort!`, the first eval is much slower than subsequent evals within a given sample, which violates benchmarking assumptions and results in weird results. For exa…

LilithHafner updated 2 months ago
1
joeoneil/conditional-backend #9

API: /evals/batch

Batch view / definition routes **7 Routes** --- **POST** /evals/batch _accessible to all upperclassmen_ Creates a new batch with either specified criteria or specific members If created by the eval…

joeoneil updated 9 months ago
1
openai/evals #637

Idea for Evals: Emotion and sentiment analysis Evals

Understanding emotions and sentiments is an essential aspect of human communication. The system's ability to recognize these emotions enables more appropriate, context-aware, and empathetic responses.…

Pabreetzio updated 1 year ago
6
edfan/firehose #28

class evals improvements

at minimum: - separate per term (combining fall/spring + iap/summer hours is wrong, see language classes) will push a temporary fix, but a more correct fix would involve editing the compiler (wh…

cjquines updated 2 years ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for evals

1000+ results
for evals