-
## Description
Evaluation documents consider what the activity achieved, whether the intended objectives were met, what the major factors influencing the achievement or non-achievement of the objec…
-
The above typo exists in this file:
https://github.com/anthropics/courses/blob/master/prompt_evaluations/08_prompt_foo_model_graded/lesson.ipynb
-
![evals_al2](https://cloud.githubusercontent.com/assets/6147456/24779990/f89fdd3e-1b01-11e7-983f-2412a7c17e7b.png)
-
could be good to have some fairness related datasets, e.g., from https://arxiv.org/abs/2108.02818. curious how LAION CLIP compares to OAI CLIP.
-
On the view page of a hypercert the checkmarks for the evaluations are too far away from the text.
Suggested solution:
- Have up to two evaluations side by side (responsive design)
- Have a border a…
-
### Describe the feature you'd like to request
Should be possible to contest an existing evaluation
### Describe the solution you'd like
Should look like an evaluation that just points to anoth…
-
Hi
can someone help me understanding the different evaluations?
There is one "Car@0.70, 0.70, 0.70" and one "Car@0.70, 0.50, 0.50"
ans also a "Car coco..."
And which of the contents (bbox, bev, …
-
We would like to provide evaluation data as part of our API.
To this effect, we need to:
- [ ] Extend our database schema to include Evalution data
- [ ] Create a Golang model conforming to the schem…
-
Hello, I'm trying to run MTEB on a cluster without internet access, but I am struggling. Here are the following instructions I've followed:
1. `$ pip install mteb`
2. `$ !pip install --upgrade git…
-
hey,
first of all thanks for your work, this project seems promising to me, since so far A/B testing in SPAs often turns out to be painful.
anyhow, i do not see any kind of evaluation tools comin…