-
使用ms-swift版本为2.6.0.dev0,transformers库为4.45.2时报错
[rank1]: Traceback (most recent call last):
[rank1]: File "/home/xxx/anaconda3/envs/f_got/lib/python3.10/site-packages/transformers/models/auto/conf…
-
在使用ms-swift微调时,由于swift所使用的版本与模型本身所使用的版本不相同,会导致报错:
ImportError: cannot import name 'log' from 'torch.distributed.elastic.agent.server.api'.
在安装ms-swift库时就有报错:
ERROR: pip's dependency resolver does n…
-
```
Model loaded -> id("mlx-community/phi-2-hf-4bit-mlx")
Error: chatTemplate("No chat template was specified")
```
For models that have a chat template this is fine, but for those that do not:
…
-
So far I've ported the components I needed to support the models I tested, but there are many more in `transformers` and `tokenizers`. For example:
- https://github.com/huggingface/swift-transforme…
-
## 环境信息
- GPU:A100
- 显存:40G
- SWIFT版本:v2.5.2
## 训练脚本
```
CUDA_VISIBLE_DEVICES=0 PYTORCH_CUDA_ALLOC_CONF="expandable_segments:True" swift sft \
--model_type llama3_2-11b-vision-instruct …
-
pkg_resources.DistributionNotFound: The 'transformers=4.33' distribution was not found and is required by ms-swift
我用swift微调qwen2vl运行报了这个版本错误,但是我是用指南中的pip install git+https://github.com/huggingface/t…
-
We can use one of the snapshot download functions from https://github.com/huggingface/swift-transformers/blob/71963c36da21b29630ee43fa0d748f8f5b88fc33/Sources/Hub/HubApi.swift#L185
-
```
使用环境:
torch==2.4
transformers==4.45.dev0
torchvision==0.19.0
4*V100
NVIDIA-SMI 535.154.05
Driver Version: 535.154.05
CUDA Version: 12.2
微调命令:
CUDA_VISIBLE_DEVICES=0,1,…
-
I am attempting to use the fine tuning with my custom dataset, however the training percentage value keeps staying at 0% and not increasing at all, after 20h of running time:
```
Train: 0%| …
-
**Describe the bug**
When using Flash Attention (--use-flash-attention true) to train Qwen2VL model with mixed data (both image and text data), the code will yield the following error
```
[rank0]: …