evaluate-llm Search Results

1000+ results
for evaluate-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

soumik12345/medrag-multi-modal #30

feat(eval): Implement an evaluation suite for the text-based…

## Description We must implement a robust evaluation suite for the text-based retrieval systems on the `anatomy` split from the [MMLU benchmark](https://huggingface.co/datasets/cais/mmlu). ## Pr…

soumik12345 updated 3 weeks ago
1
explodinggradients/ragas #1443

Faithfulness Errors

- [x] I checked the [documentation](https://docs.ragas.io/) and related resources and couldn't find an answer to my question. **Your Question** I got the following error. ERROR:ragas.executor:…

TakutoIyanagi-littletree updated 1 month ago
2
irthomasthomas/undecidability #951

LLM-Agents-Papers repo

- [ ] [LLM-Agents-Papers/README.md at main · AGI-Edgerunners/LLM-Agents-Papers](https://github.com/AGI-Edgerunners/LLM-Agents-Papers/blob/main/README.md?plain=1) # LLM-Agents-Papers ## :writing_hand…

ShellLM updated 1 week ago
1
irthomasthomas/undecidability #943

awesome-llm-planning-reasoning/README.md at main · samkhur00…

- [ ] [awesome-llm-planning-reasoning/README.md at main · samkhur006/awesome-llm-planning-reasoning](https://github.com/samkhur006/awesome-llm-planning-reasoning/blob/main/README.md?plain=1) # awesom…

ShellLM updated 2 weeks ago
1
fl4p/fetlib #15

Re-evaluate LLM approach on value correctness

Results from the new [benchmark](https://github.com/fl4p/fetlib/blob/dev/read_llm_json.py) comparing actual min/typ/max field values: ``` num *EQUAL* *VALUES*: …

fl4p updated 2 months ago
1
eugeneyan/eugeneyan-comments #85

https://eugeneyan.com/writing/llm-evaluators/

# Evaluating the Effectiveness of LLM-Evaluators (aka LLM-as-Judge) Use cases, techniques, alignment, finetuning, and critiques against LLM-evaluators. [https://eugeneyan.com/writing/llm-evaluators/…

utterances-bot updated 3 months ago
4
truera/trulens #1638

[ISSUE] record.wait_for_feedback_results() with TruLlama not…

**Bug Description** I'm using huggingface as the provider to generate feedback from a RAG model that uses TruLlama as the base of the feedback recorder. Even though I'm using _record.wait_for_feedbac…

paul-gleeson updated 1 week ago
2
serratus-bio/open-virome #142

[LLM] Define a falsifiable, measurable hypothesis

### Task 3: Define a falsifiable, measurable hypothesis. > Our first hypothesis questions the validity of using an AI model for querying a database > at all, and whether an LLM can effectively retrie…

ababaian updated 1 week ago
3
explodinggradients/ragas #1358

API KEY

[ ] I checked the [documentation](https://docs.ragas.io/) and related resources and couldn't find an answer to my question. **Your Question** I wrote this code and I get the error: The api_key …

amin-kh96 updated 2 weeks ago
36
explodinggradients/ragas #1478

Models via ChatOllama raise ConnectError()

For this code section using `ChatMistralAI` and `MistralAIEmbeddings` ```python from langchain_ollama.chat_models import ChatOllama from langchain_ollama.embeddings import OllamaEmbeddings import …

Sohammhatre10 updated 1 month ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for evaluate-llm

1000+ results
for evaluate-llm