-
Hello,
I have currently tweaked the prompt optimization tutorial so that I can see if I can improve it's ability to improve on medical multiple -choice datasets. However, the results are getting p…
-
Does the 'exp_zx/msrvtt/run.sh' support zero shot demos for my own custom data or do I have to write my own testing scripts? I am currently looking at 'VideoMamba/videomamba/video_mm/exp_zs/msrvtt/m16…
-
Hi, I downloaded the GeneOntology dataset from the provided Zenodo link, but I came across this error during model evaluation:
```
PytorchStreamReader failed reading zip archive: invalid header or…
-
# Task Name
Melodic pattern reproduction performance grading
## Task Objective
to help the model learn how to grade human melodic pattern reproduction, and help the learners to grade their pe…
-
Hello, I don't know what I'm doing wrong. I received the following error as indicated in the title.
My input was as shown on this website: :
[Hugging Face - Ger-RAG-eval](https://huggingface.co/da…
-
i have the same problems with this issue ( https://github.com/EleutherAI/lm-evaluation-harness/issues/1347 )
i just want to eval gsm8k from local dataset folder, as the web in China can't access h…
-
Hello! I just follow the instructions on Readme for testing the codes on FE240hz Dataset. However, after doing all the right things, an error occurred and there is nowhere to fix it. Could you please …
-
hello, how does the evaluation metrics of Summe and TVSum datasets τ and ρ calculate? I only see the calculation method of fscore in the code.
Looking forward to getting your reply, thank you very…
-
# Taiwanese Hokkien Tone Recognition
This task aims to recognize "tones" in Taiwanese Hokkien. Taiwanese Hokkien is a tonal language with multiple tones that can change the meaning of words. Accura…
-
Hello Leland,
Thank you for sharing this new algorithm.
I have a question regarding evaluation measures of dimensionality reduction methods. I'm aware of trustworthiness and continuity, but I'm lo…