evaluate-llm Search Results

1000+ results
for evaluate-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

explodinggradients/ragas #1278

Error w/ evaluate function using LLamaIndex AzureOpenAI mode…

[ ] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug. **Describe the bug** Further request for LLamaIndex support regarding Azure OpenAI…

sam-h-long updated 1 month ago
6
irthomasthomas/undecidability #901

[2303.16634] G-Eval: NLG Evaluation using GPT-4 with Better …

- [ ] [[2303.16634] G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment](https://arxiv.org/abs/2303.16634) # [2303.16634] G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment …

ShellLM updated 2 months ago
1
explodinggradients/ragas #1247

Facing error for evaluate() for Langchain instance LLM and E…

[ ] I checked the [documentation](https://docs.ragas.io/) and related resources and couldn't find an answer to my question. ** Facing error with using Langchain wrapped hugging face models** I am …

kartik-angadi updated 1 month ago
1
horseee/LLM-Pruner #58

Evaluation：UnicodeDecodeError: 'utf-8' codec can't decode by…

Thank you very much for doing such great open-source work! i try: CUDA_VISIBLE_DEVICES=X bash scripts/evaluate.sh PATH_OR_NAME_TO_BASE_MODEL PATH_TO_SAVE_TUNE_MODEL PATH_TO_PRUNE_MODEL EPOCHS_YOU…

manlenzzz updated 1 month ago
1
run-llama/llama_index #16166

[Feature Request]: Support candidate generations

### Feature Description The most popular LLMs such as OpenAI support candidate generations which means to generate n responses for the same prompt. This feature can be used in RAG, evaluations and mo…

TupleType updated 1 month ago
1
llm-jp/experiments #60

[評価] - llm-jp-eval 1.4.1による統合評価

# Overview llm-jp-eval 1.4.1を各種モデルで実施するための統合実験。 # Details ## 実験の実施手順 1. 評価を行いたいモデルのHugging Face形式チェックポイントを用意してください。 1. チェックポイントのパスと評価タスク名を本issueのコメントとして投下してください。 1. @odashi がsakura側で評価実験…

odashi updated 2 weeks ago
1
fani-lab/LADy #96

Dataset Creation using LLMs!

We’re so happy to have you on board with the LADy project, Calder! We use the issue pages for many purposes, but we really enjoy noting good articles and our findings on every aspect of the project. …

Sepideh-Ahmadian updated 1 day ago
8
AILab-CVC/SEED-Bench #12

VLMs vs LLMs evaluation

Hello 👋 First of all thank you for the great work and evaluation results! I have understood that in many cases you predicted outputs for each question based on the choice that minimizes the loss…

idan-tankel updated 10 months ago
1
mit-han-lab/smoothquant #69

general question about SmoothQuant kv-cache quantization

1. Is kv-cache actually **not used** in all the LLM-evaluation tasks, since those tasks usually takes **only one-step** attention calculation, not like language generating process which needs a lot of…

brisker updated 5 days ago
1
InternLM/xtuner #937

docker利用xtuner微调时，出错，不知道哪的问题？

(xtuner) root@d6d9f5d36abe:~/model/InternVL_2_2b_safetensors# xtuner train ./internvl_v2_internlm2_2b_qlora_finetune_copy.py The installed version of bitsandbytes was compiled without GPU support. 8-…

159357hou updated 1 week ago
2

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for evaluate-llm

1000+ results
for evaluate-llm