-
### 软件环境
```Markdown
- paddlepaddle-gpu: 0.0.0.post120
- paddlenlp: 3.0.0b2
```
### 重复问题
- [X] I have searched the existing issues
### 错误描述
```Markdown
跑llama3-8b的sft微调时,报错
Traceback (most r…
-
Hello,
I was wondering whether there is any difference between Answer Relevance and Answer Faithfulness. Conceptually there is of course, but the code for training LLM judges and actually judging s…
-
Since there are many datasets in the format of Huggingface datasets, it would be convenient if `preprocess_data.py` can directly preprocess and tokenize from HF datasets.
-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
when I just add one line in the `examples/extras/adam_mini/qwen2_full_sft.yaml` got a error below.
```…
-
For example https://huggingface.co/datasets/ontocord/CulturaY.
-
**Inference:**
```bash
CUDA_VISIBLE_DEVICES=0 swift infer --model_type got-ocr2 --model_id_or_path stepfun-ai/GOT-OCR2_0
```
```
-
**Is your feature request related to a problem? Please describe.**
I need to use CCL to send neural network weights from one device to another without using host.
Also we need to have all_reduce sup…
-
**Describe the bug**
This bug is similar to #4055 , I provide a repro here.
**To Reproduce**
Please put these three files in the same directory (remember to change the first two `.txt -> .py` and…
-
### Checklist
- [ ] 1. I have searched related issues but cannot get the expected help.
- [ ] 2. The bug has not been fixed in the latest version.
- [ ] 3. Please note that if the bug-related issue y…
-
# URL
- https://arxiv.org/abs/2404.01869
# Authors
- Philipp Mondorf
- Barbara Plank
# Abstract
- Large language models (LLMs) have recently shown impressive performance on tasks involving reaso…