llm-evaluation Search Results

1000+ results
for llm-evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

explodinggradients/ragas #1383

Concerning the WARNING in faithfulness

[ ] I checked the [documentation](https://docs.ragas.io/) and related resources and couldn't find an answer to my question. **Your Question** what is unclear to you? What would you like to know? …

TakutoIyanagi-littletree updated 2 days ago
5
explodinggradients/ragas #1338

Synthetic test data generation failed for huggingface models

[ ] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug. **Describe the bug** I am trying to generate test data for ragas evaluation, I hav…

wanjeakshay updated 1 week ago
2
run-llama/llama_index #15939

[Question]: Issues with Context Generation and Metric Suppor…

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question When evaluating a RAG retrieval service using the llama-index evaluation method, I encou…

adityamity updated 2 weeks ago
31
yixuantt/MultiHop-RAG #13

Enhance String Intersection Function for More Accurate Answe…

Thank you for your excellent work! In the current implementation of `qa_evaluate`, the `has_intersection` function used for comparing predicted answers with gold standard answers splits strings on …

yarikama updated 2 weeks ago
1
gomyway1216/rag #26

Research Notes on RAG

This is a thread for Carolina to summarize her research on RAG. The purpose is to share the information among project members.

carolina-museum updated 1 hour ago
3
run-llama/llama_index #16336

[Bug]: "Event loop is closed" when using LATS agent

### Bug Description Following the tutorial on LATS agent (https://docs.llamaindex.ai/en/stable/examples/agent/lats_agent/) result in the RuntimeError : Event loop is closed ### Version 0.11.12 ##…

ArthurDelannoyazerty updated 2 hours ago
2
AILab-CVC/SEED-Bench #12

VLMs vs LLMs evaluation

Hello 👋 First of all thank you for the great work and evaluation results! I have understood that in many cases you predicted outputs for each question based on the choice that minimizes the loss…

idan-tankel updated 9 months ago
1
Kacper-W-Kozdon/promptflow_unify_integration #5

Some Suggestions for the Unify AI Tool YAMLs 😊

Lastly i was looking at the `YAML` files for the **UnifyAI** tools, and I had a few ideas that might help: **Here are my thoughts 🤔**: - `evaluate_llm_tool.yaml:` It seems like the `prompt…

KatoStevenMubiru updated 1 month ago
4
nlpyang/geval #8

How is the "Auto CoT" prompt defined?

G-Eval includes "Auto Chain-of-Thoughts for NLG Evaluation" as a component where the CoT steps to carry out evaluation are produced by an LLM. The paper nor this repo, however, include the prompt defi…

calvdee updated 3 weeks ago
2
wyona/katie-backend #30

Automatic evaluation of answers by LLM(s)

Estimate key LLM metrics: - Overall quality score, accuracy - Hallucination rate (hallucination detection) - Relevancy - Coherence - Responsible AI violations - Safety

michaelwechner updated 3 months ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for llm-evaluation

1000+ results
for llm-evaluation