-
**Describe the bug**
I followed the example in MSDocs [Evaluate on test dataset using `evaluate()`](https://learn.microsoft.com/en-us/azure/ai-studio/how-to/develop/flow-evaluate-sdk#evaluate-on-test…
-
It could be interesting to explore if we could use [MusPy](https://salu133445.github.io/muspy/) to add some text/symbolic music evals.
/cc @salu133445
-
I used OpenAI Evals to run an eval and get a jsonl report. I then pip installed zeno-evals to view the report. But when I went back to run another eval I got the following error:
```
ImportError: …
-
**Is your feature request related to a problem? Please describe.**
When running evals across a large dataframe, it'd be useful to temporarily checkpoint/persist the results in case something happens …
-
Think about using nets for different stages of the game (e.g. Stockfish has some conditionals inside the net that kind of act like "extensions" of the net based on piece count if I understand correctl…
-
V1
http://jsbin.com/duvayova/1/edit
V2
http://jsbin.com/ditogatu/5/edit
I had no net for a couple of days (ooooh the humanity!) so played with your code for something to do.
Originally I was only int…
PAEz updated
10 years ago
-
If I have something like `@b rand(1000) sort!`, the first eval is much slower than subsequent evals within a given sample, which violates benchmarking assumptions and results in weird results. For exa…
-
Batch view / definition routes
**7 Routes**
---
**POST** /evals/batch
_accessible to all upperclassmen_
Creates a new batch with either specified criteria or specific members
If created by the eval…
-
Understanding emotions and sentiments is an essential aspect of human communication. The system's ability to recognize these emotions enables more appropriate, context-aware, and empathetic responses.…
-
at minimum:
- separate per term (combining fall/spring + iap/summer hours is wrong, see language classes)
will push a temporary fix, but a more correct fix would involve editing the compiler (wh…