-
[ ] I checked the [documentation](https://docs.ragas.io/) and related resources and couldn't find an answer to my question.
**Your Question**
My dataset here:
{'question': 'what are you going', '…
-
Hi, thanks again for this cool work!
Where can I locate the random 1k (image, long text) pairs separated from ShareGPT4V for long-caption image-text retrieval evaluation? Can you release this data …
-
Based on info in https://github.com/US-EPA-CAMD/easey-ui/issues/6205.
During initial queueing:
- There is currently a popup indicating that queueing of a file was successful. We need to add a new mes…
-
Currently, an evaluation's results page can be viewed as "Public" (shows results that are visible for everyone logged in), "Own" (shows all results that are visible for the current user), and "Export"…
-
I run the script `python stage1.py --root 'data/kv_data' --gtm --lm --devices '[0]' --filename pcdes_evaluation --init_checkpoint "all_checkpoints/share/stage1.ckpt" --rerank_cand_num 128 --num_query…
-
About Hacktoberfest contributions: https://github.com/evidentlyai/evidently/wiki/Hacktoberfest-2024
**Description**
The ROUGE (Recall-Oriented Understudy for Gisting Evaluation) metric evaluates…
-
I ran the evaluation plan from here https://github.com/ErikssonJ/Probenecid-Model/tree/main
And the names of the processes are too small (actually, smaller then the following text), so it is really…
-
Hi, thanks for sharing your code!
I have a question regarding scoring F1/EM on MultimodalQA. Their answer annotation (e.g., `MMQA_dev.jsonl`) has multiple GT answers, but [the evaluation code in th…
j-min updated
4 weeks ago
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
I designed a chatbot with an Agent to perform a series of actions.
My agent works like…
-
# Prepare Dataset for AccentOptimizer
**Description:**
We need to prepare a suitable dataset for training the pronunciation evaluation machine learning model. This dataset should include audio s…