-
Whenever I follow the installation instructions, apply autogptq to llama2-hf, then try to run qalora.py on the checkpoints produced by autogptq. I install autogptq using pip install autogptq[triton], …
-
### 🐛 Describe the bug
i use pytorch==2.3.0 and peft to train llama3 8b , when i run my code, its raise error like:
```text
torch._amp_foreach_non_finite_check_and_unscale_(
RuntimeError:…
-
To better plan and have an overview of the tasks still open to implementing the Pushshift Reddit benchmark I ask you to go through the following functions with regard to the Pushshift Reddit benchmark…
-
在32G 的卡上跑,clip 默认设置的200。 batch-size=1都会溢出。也是服了。
然后发现preprocess里面的clip 其实没有clip掉。最长的句子竟然会出现1146个char 溢出应该也正常吧。。。
修改了代码,贴上来,有遇到同样问题的人可以参考一下。
```
import argparse
import numpy as np
import os
imp…
-
I would like to ask for your advice on the following two questions.
1. DPO train does not seem to support DeepSpeed ZeRO. After manually integrating `DPOAlignerArguments` with the `FinetunerArguments…
-
Hi,
I encountered a problem when using your code.
here is the command I used:
```
python script_bin/train_model.py \
--trainer --train-inputs cnn-dailymail/inputs/train \
--train-labels …
-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
[2024-06-07 10:17:14,980] [INFO] [real_accelerator.py:191:get_accelerator] Setting ds_accelerator t…
-
Hi, I have installed ipex-llm follow the docs: https://github.com/intel-analytics/ipex-llm/tree/main/python/llm/example/CPU/QLoRA-FineTuning
and i meet the error
```
found intel-openmp in /root/…
-
when I try to use this repo to run the AMI dataset , I got this error ,can you help me?
My command is:
`python train_model.py --trainer --train-inputs /home/workspace/IdeaProjects/word2sentExtra…
-
**Describe the bug**
I\m training a seq2seq transformer model in librispeech_100 egs2 dir.
I ahve set ctc_weight =0.0 to disable ctc in model training and I expect that to hold for decoding too.
…