evaluate-llm Search Results

1000+ results
for evaluate-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

explodinggradients/ragas #1540

How to check my embedding llama3 model is running for evalua…

[ ] I checked the [documentation](https://docs.ragas.io/) and related resources and couldn't find an answer to my question. **Your Question** what is unclear to you? What would you like to know? …

RyanTree-HS updated 1 month ago
2
confident-ai/deepeval #1149

GEval not focusing on expected_output & Relying on OpenAI in…

**BUG** While testing DeepEval's GEval metric to evaluate complex queries, especially where LLMs failed to answer, I faced an issue where DeepEval is overseeing the provided expected_output relies & …

pavan-growexxer updated 1 week ago
2
NeroCube/bookmark #442

How to do the a/b testing for LLM base on user like or unlik…

Here's a simple example of how to perform A/B testing in Python using user likes and dislikes as feedback: ### Step-by-Step Guide 1. **Install Required Libraries** ```python !pip install…

NeroCube updated 4 days ago
2
explodinggradients/ragas #1186

Evaluation with IBM WatsonX LLM's

Hey! I have tried my hands on RAGAS with Watson LLM, the major issue I am facing is getting the warning: "Failed to parse output. Returning None." Continuously. Is there any fix for this? Is i…

swayam-khandelwal updated 3 months ago
1
InternLM/xtuner #952

xtuner 微调internLM2.5出错

命令：(xtuner-env) root@autodl-container-d293479255-f53de588:~/autodl-tmp/data# xtuner train sh/internlm2_5_chat_7b_qlora_oasst1_e3_copy.py --deepspeed deepspeed_zero2 报错信息：10/18 16:45:32 - mmengine - W…

sakura073 updated 1 day ago
9
pytorch/torchchat #1334

Multimodal Eval Enablement (Looking for Developer to Impleme…

### 🚀 The feature, motivation and pitch ***Please note that since the actual implementation is going to be simple, and the design has already been reviewed, the purpose of this GitHub Issue is to l…

Olivia-liu updated 1 week ago
13
NVIDIA/TensorRT-LLM #2489

Issues with installing on Windows

### System Info - CPU architecture: x64 - Libraries - TensorRT-LLM: 0.14.0 - CUDA: 12.6, 12.4, 12.1 - OS: Windows 10 ### Who can help? @byshiue ### Information - [x] The official example scr…

PyroGenesis updated 1 day ago
2
AkihikoWatanabe/paper_notes #1464

Self-Taught Evaluators, Tianlu Wang+, N/A, arXiv'24

# URL - https://arxiv.org/pdf/2408.02666 # Affiliations - Tianlu Wang, N/A - Ilia Kulikov, N/A - Olga Golovneva, N/A - Ping Yu, N/A - Weizhe Yuan, N/A - Jane Dwivedi-Yu, N/A - Richard Yu…

AkihikoWatanabe updated 2 weeks ago
1
Arize-ai/phoenix #3738

[experiments] pairwise evaluator

Implement a pairwise evaluator that leverages LLM as a judge to judge two generations against each-other. In the case of experiments this would assume to perform judgement against the expected> ht…

mikeldking updated 1 day ago
1
vllm-project/vllm #9875

[Bug]: Running on a single machine with multiple GPUs error

### Your current environment Name: vllm Version: 0.6.3.post2.dev171+g890ca360 ### Model Input Dumps _No response_ ### 🐛 Describe the bug I used the interface from this vllm repository …

Wiselnn570 updated 3 weeks ago
6

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for evaluate-llm

1000+ results
for evaluate-llm