evaluate-llm Search Results

1000+ results
for evaluate-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

explodinggradients/ragas #1528

[R-298] Support Passing Multiple Evaluator Models to reduce …

**Describe the Feature** I think it could be useful to support multiple evaluator models, and average performance across models to reduce bias. **Why is the feature important for you?** It seems l…

bdytx5 updated 3 weeks ago
3
livekit/agents #1044

Feature: add after_llm_cb to modify llm generated text

## Question regarding handling special tokens in conversation transcription First of all, thanks for making this wonderful SDK to easily create voice-enabled applications! I'm currently buildi…

achapla updated 3 weeks ago
1
tjunlp-lab/Awesome-LLMs-Evaluation-Papers #7

SeaEval: Multilingual LLM Evaluation

Please note our paper on evaluation, which could be an important building block for multilingual evaluation and cultural understanding. [SeaEval for Multilingual Foundation Models: From Cross-Lingu…

BinWang28 updated 1 year ago
7
junhwi/next-gen-ai #46

24/10/20

JudgeBench: A Benchmark for Evaluating LLM-based Judges https://arxiv.org/abs/2410.12784

junhwi updated 1 month ago
1
microsoft/LLMLingua #193

[Question]: Reproduce end2end latency results of LLMLingua-2

### Describe the issue @pzs19 I would like to reproduce and expand the end2end latency benchmark results of the LLMLingua-2 paper and was therefore wondering if you could provide more details on yo…

cornzz updated 2 weeks ago
3
mit-han-lab/smoothquant #69

general question about SmoothQuant kv-cache quantization

1. Is kv-cache actually **not used** in all the LLM-evaluation tasks, since those tasks usually takes **only one-step** attention calculation, not like language generating process which needs a lot of…

brisker updated 1 month ago
1
explodinggradients/ragas #1316

Significance of docstore in TestsetGenerator.

[ ] I checked the [documentation](https://docs.ragas.io/) and related resources and couldn't find an answer to my question. **Your Question** What is the use of docstore in TestsetGenerator. How i…

adityamity updated 2 months ago
9
xp1632/DFKI_working_log #64

LLM+Visual Programming_Technical Courses

- This issue focuses on the technical courses we take about LLM, we'll put the paper part in https://github.com/xp1632/DFKI_working_log/issues/70 --- 1. **ChainForge** https://chainforge.ai/ …

xp1632 updated 1 month ago
21
explodinggradients/ragas #1226

Local Model Runner in Executor raises exceptions

**Describe the bug** I want to use local llms to evaluate my rag app, I have tried Ollama and HuggingFace models but neither of them is working. Ragas version: 0.1.11 Python version: 3.11.3 **…

g-hano updated 3 months ago
5
run-llama/llama_index #16166

[Feature Request]: Support candidate generations

### Feature Description The most popular LLMs such as OpenAI support candidate generations which means to generate n responses for the same prompt. This feature can be used in RAG, evaluations and mo…

TupleType updated 2 months ago
1

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for evaluate-llm

1000+ results
for evaluate-llm