llm-evaluation Search Results

1000+ results
for llm-evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

BoundaryML/baml #756

Few-shot examples support?

Thanks for this refreshing take on LLM prompt generation and evaluation, it's very promising. I was wondering if few-shot examples should have their own first-class support in BAML due to their pow…

mbbyn updated 3 days ago
3
unaheidi/unaheidi.github.io #32

大模型阅读清单

LLM 大模型学习必知必会系列 (一)：大模型基础知识篇 https://xie.infoq.cn/article/4a3cc4bb786ad63e31414c466?utm_campaign=geektime_search&utm_content=geektime_search&utm_medium=geektime_search&utm_source=geektime_search&utm_t…

unaheidi updated 1 month ago
1
OSU-NLP-Group/LLM-Planner #18

running run_eval.py gives error

hey all , I am trying to run the evaluation file but it is giving the following errors. ``` (alfworld) srinjoym@user:~/LLM-Planner/src$ python run_eval.py --config gpt4_base_config.yaml Traceback …

Acejoy updated 3 weeks ago
1
explodinggradients/ragas #1073

ExceptionInRunner: The runner thread which was running the j…

[ ] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug. **Describe the bug** Can't get evaluation to work. constantly get the error: The r…

edubrigham updated 1 hour ago
4
huu4ontocord/MDEL #37

Integrate with LLM evaluation frameworks

Integrate MDEL with various evaluation framework - [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) - [helm](https://github.com/stanford-crfm/helm)

kenhktsui updated 1 year ago
3
RLHFlow/Online-RLHF #6

Model evaluation issue

Hi, I am trying to evaluate the model RLHFlow/LLaMA3-iterative-DPO-final with MT Bench. I use the inference environment in ReadME and follow the scripts from https://github.com/lm-sys/FastChat/tree/ma…

matouk98 updated 1 month ago
5
vllm-project/vllm #4904

[Bug]: llm_engine_example.py (more requests) get stuck

### Your current environment ```text Collecting environment information... PyTorch version: 2.3.0+cu121 …

CsRic updated 1 month ago
1
acl-org/acl-anthology #3177

Updating Paper Metadata for 2024.eacl-demo.23

### Confirm that this is a metadata correction - [X] I want to file corrections to make the metadata match the PDF file hosted on the ACL Anthology. ### Anthology ID 2024.eacl-demo.23 ### Type of …

firojalam updated 4 days ago
4
Scale3-Labs/langtrace-python-sdk #230

Issue passing the user feedback

Hello everyone, I am trying to implement the trace user feedback. And this seems to be working well (the endpoint returns a 200 code response). However, I don't see the span/traces on the online d…

bukosabino updated 3 days ago
8
EleutherAI/lm-evaluation-harness #1688

IndexError on BBH tasks

Across a few models and a few BBH tasks, I obtain this error: ``` match = [m for m in match if m][0] IndexError: list index out of range ``` The full stack trace is below: ``` $ lm_ev…

RylanSchaeffer updated 2 months ago
2

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for llm-evaluation

1000+ results
for llm-evaluation