-
- [ ] Move the [WDL](https://dockstore.org/my-workflows/github.com/rlorigro/sv_merge/hapestry_merge_scattered) to appropriate location?
- [ ] Check compatibility with existing pipeline (what are req…
-
I was wondering how the trained models are intended to be evaluated. I don't believe that the paper states how many samples were used to compute the metrics. The code appears to give some indication b…
-
[ ] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
**Describe the bug**
I cannot evaluate open source gemma:2b model using ragas. I…
-
Hi @MXueguang ,
I am currently evaluating a sparse retriever, splade++ (cocondensor-ensembledistil) on arguana using below command:
```
./eval_beir.sh --dataset arguana
--tokenize…
-
Sentry Issue: [METAMASK-MOBILE-2WQH](https://metamask.sentry.io/issues/5973266451/?referrer=github_integration)
```
TypeError: undefined is not an object (evaluating 'this._queue.shift')
at this.lo…
-
https://osf.io/ry23m/
-
The `FloatExprEvaluator` should ignore `_Atomic` types, and only care about the wrapped type.
For example, treat `_Atomic(float)` as a simple `float`.
Clang with assertions would fail on this vali…
-
This is a general effort for profiling, and then speeding up underlying code by:
1. Evaluating if there are inefficiencies in the implementation
2. Evaluating if there's repeated or unneeded step…
-
Hi, I tried to eval the Llama-3-Instruct-8B-SimPO-v0.2 checkpoint by arena-hard-auto, and I only got
Llama-3-Instruct-8B-SimPO-v0.2 | score: 35.4 | 95% CI: (-3.2, 2.0) | average #tokens: 530
…
-
### Prerequisite
- [X] I have searched [Issues](https://github.com/open-mmlab/mmdetection3d/issues) and [Discussions](https://github.com/open-mmlab/mmdetection3d/discussions) but cannot get the exp…