-
[ ] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
**Describe the bug**
Further request for LLamaIndex support regarding Azure OpenAI…
-
- [ ] [[2303.16634] G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment](https://arxiv.org/abs/2303.16634)
# [2303.16634] G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment
…
-
[ ] I checked the [documentation](https://docs.ragas.io/) and related resources and couldn't find an answer to my question.
** Facing error with using Langchain wrapped hugging face models**
I am …
-
Thank you very much for doing such great open-source work!
i try:
CUDA_VISIBLE_DEVICES=X bash scripts/evaluate.sh PATH_OR_NAME_TO_BASE_MODEL PATH_TO_SAVE_TUNE_MODEL PATH_TO_PRUNE_MODEL EPOCHS_YOU…
-
### Feature Description
The most popular LLMs such as OpenAI support candidate generations which means to generate n responses for the same prompt. This feature can be used in RAG, evaluations and mo…
-
# Overview
llm-jp-eval 1.4.1を各種モデルで実施するための統合実験。
# Details
## 実験の実施手順
1. 評価を行いたいモデルのHugging Face形式チェックポイントを用意してください。
1. チェックポイントのパスと評価タスク名を本issueのコメントとして投下してください。
1. @odashi がsakura側で評価実験…
-
We’re so happy to have you on board with the LADy project, Calder! We use the issue pages for many purposes, but we really enjoy noting good articles and our findings on every aspect of the project.
…
-
Hello 👋
First of all thank you for the great work and evaluation results!
I have understood that in many cases you predicted outputs for each question based on the choice that minimizes the loss…
-
1. Is kv-cache actually **not used** in all the LLM-evaluation tasks, since those tasks usually takes **only one-step** attention calculation, not like language generating process which needs a lot of…
-
(xtuner) root@d6d9f5d36abe:~/model/InternVL_2_2b_safetensors# xtuner train ./internvl_v2_internlm2_2b_qlora_finetune_copy.py
The installed version of bitsandbytes was compiled without GPU support. 8-…