-
一、问题描述:
使用Lora进行认知训练,正常完成。然后使用命令行推理,报错“RuntimeError: cutlassF: no kernel found to launch!”。请帮忙看看问题原因。非常感谢!
二、详细描述:
1. 认知训练,正常结束
python llm_sft.py \
--model_type qwen1half-0_5b-chat \
…
-
### 是否已有关于该错误的issue或讨论? | Is there an existing issue / discussion for this?
- [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions
### 该问题是否在FAQ中有解答? | Is there an existing…
-
**这是我的参数配置*
运行后会执行几分钟,然后报错停止。检查现存也看到没有跑满,请问是什么问题。
CUDA_VISIBLE_DEVICES=0,1,2,3 \
swift sft \
--model_type qwen1half-7b-chat \
--sft_type lora \
--tuner_backend swift \
--dtype A…
-
Hi,
I came across this repository and have played around with the notebooks a little bit and succeeded in running the model locally to perform inpainting on midi files of mine.
I was wondering whe…
-
### Checklist
- [X] 1. I have searched related issues but cannot get the expected help.
- [ ] 2. The bug has not been fixed in the latest version.
- [ ] 3. Please note that if the bug-related issue y…
-
### 🐛 Describe the bug
Using MPS for BERT inference appears to produce about a 2x slowdown compared to the CPU. Here is code to reproduce the issue:
```python
# MPS Version
from transformers i…
-
**Describe the bug**
```bash
Train: 12%|████████████ | 944/7866 [51:45
-
之前可以运行,换一个服务器之后就发生报错。输出信息如下:
[INFO:swift] Model file configuration.json is different from the latest version `v1.0.2`,This is because you are using an older version or the file is updated manually.
…
-
### System Info
```shell
Name: transformers
Version: 4.40.1
Name: optimum
Version: 1.19.1
```
At some point in time the following export changed from a single output with the name `last_…
-
我现在用的是4张A100的40G显卡,跑InternVL-v1.5的lora微调,当我不使用deepspeed时,全部调用这四张卡也会爆显存。当我使用教程里给的default-zero2时会有以下报错,UserWarning: None of the inputs have requires_grad=True. Gradients will be None以及
AttributeError: …