rag-evaluation Search Results

612 results
for rag-evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Giskard-AI/giskard #1872

generate_test does not work with Azure Open AI key

### Issue Type Bug ### Source source ### Giskard Library Version 2.8.0 ### Giskard Hub Version NaN ### OS Platform and Distribution Ubuntu 22.04.4 LTS ### Python version …

terry07 updated 5 months ago
2
microsoft/semantic-kernel #4606

Python: Context Result is the same for multiple inquiries.

**Describe the bug** I am running 4 inquiries to get llm response by asyncio.gather(). And output the response into a dataset for RAGAS evaluation. But the results are all the same from Context.Re…

HuskyDanny updated 7 months ago
1
dna-ey-fso/azure-production-rag #13

Run batch evaluate script within the back-end of the rag app

sofyan-ajridi-ey updated 7 months ago
1
kubeagi/arcadia #286

Evaluations

## Overall workflow **TO BE DEFINED** ## Evaluation Types ### RAG Evaluation @Lanture1064 @bjwswang Our current RAG solution flow : 1. Dataset/VersionedDataset provides source file…

bjwswang updated 8 months ago
5
huggingface/text-generation-inference #1776

One of two concurrent request generating empty text (Mistral…

### System Info Running TGI docker with command `docker run --rm --gpus all --ipc=host -p 8080:80 -v /root/.cache/huggingface/hub:/data -e HF_API_TOKEN=hf_XXXX ghcr.io/huggingface/text-generatio…

TysonHeart updated 3 months ago
2
run-llama/llama_index #11308

No score in Trulens dashboard, log error unable to evaluate

### Bug Description My basic idea is to build an automated LLM evaluation program using llama index and Trulens. The LLM I used is Chat GLM, which has the same streaming API calling method as Chat GP…

Scottie-tech updated 7 months ago
1
AlibabaResearch/DAMO-ConvAI #39

Panel of BIRD Annotation Issues.

Hi all, Although `BIRD` has incurred significant annotation costs, we still cannot guarantee that all the data is accurately labeled. **_We hope that the community can assist us in building BIRD t…

huybery updated 1 week ago
35
explodinggradients/ragas #680

answer_relevancy

**Your Question** what model is used by ragas for answer relevancy score calculation ? I didn't mentioned any model name or inference API till ragas generating an evaluation report ? **Additio…

Prashant0520 updated 7 months ago
1
stanfordnlp/dspy #568

Error when running Evaluate: 'str' object has no attribute …

I am trying out DSPy 2.3.6 with a simple transcript summarization example: ```python class Summarizer(dspy.Module): def __init__(self): super().__init__() self.summarizer = …

gramster updated 6 months ago
13
truera/trulens #863

Empty board from tru.get_leaderboard(app_ids=["RAG v1"])

I am currently using the code provided in this [Colab notebook](https://colab.research.google.com/github/truera/trulens/blob/main/trulens_eval/examples/quickstart/quickstart.ipynb#scrollTo=-HyRuVA2qR7…

kouskouss updated 7 months ago
3

上一页 1...40 41 42 43 44 45 46...62 下一页

612 results for rag-evaluation

612 results
for rag-evaluation