-
[ ] I checked the [documentation](https://docs.ragas.io/) and related resources and couldn't find an answer to my question.
**Your Question**
what is unclear to you? What would you like to know?
…
-
**BUG**
While testing DeepEval's GEval metric to evaluate complex queries, especially where LLMs failed to answer, I faced an issue where DeepEval is overseeing the provided expected_output relies & …
-
Here's a simple example of how to perform A/B testing in Python using user likes and dislikes as feedback:
### Step-by-Step Guide
1. **Install Required Libraries**
```python
!pip install…
-
Hey!
I have tried my hands on RAGAS with Watson LLM, the major issue I am facing is getting the warning:
"Failed to parse output. Returning None."
Continuously.
Is there any fix for this?
Is i…
-
命令:(xtuner-env) root@autodl-container-d293479255-f53de588:~/autodl-tmp/data# xtuner train sh/internlm2_5_chat_7b_qlora_oasst1_e3_copy.py --deepspeed deepspeed_zero2
报错信息:10/18 16:45:32 - mmengine - W…
-
### 🚀 The feature, motivation and pitch
***Please note that since the actual implementation is going to be simple, and the design has already been reviewed, the purpose of this GitHub Issue is to l…
-
### System Info
- CPU architecture: x64
- Libraries
- TensorRT-LLM: 0.14.0
- CUDA: 12.6, 12.4, 12.1
- OS: Windows 10
### Who can help?
@byshiue
### Information
- [x] The official example scr…
-
# URL
- https://arxiv.org/pdf/2408.02666
# Affiliations
- Tianlu Wang, N/A
- Ilia Kulikov, N/A
- Olga Golovneva, N/A
- Ping Yu, N/A
- Weizhe Yuan, N/A
- Jane Dwivedi-Yu, N/A
- Richard Yu…
-
Implement a pairwise evaluator that leverages LLM as a judge to judge two generations against each-other. In the case of experiments this would assume to perform judgement against the expected>
ht…
-
### Your current environment
Name: vllm
Version: 0.6.3.post2.dev171+g890ca360
### Model Input Dumps
_No response_
### 🐛 Describe the bug
I used the interface from this vllm repository …