-
### System Info / 系統信息
Traceback (most recent call last):
File "/home/sa/swift/swift/cli/sft.py", line 5, in
sft_main()
File "/home/sa/swift/swift/utils/run_utils.py", line 32, in x_main
result =…
-
环境:4090*4,python=3.10.12,ubuntu
报错如下:
User: 请描述图片内容
Exception in thread Thread-7 (generate):
Traceback (most recent call last):
File "/home/user/anaconda3/envs/ms-swift/lib/python3.10/threading…
-
**Describe the bug**
When deploying LLaVA-NeXT-Video-34B-hf, I find that the configuration key passed to transformers is "llava_next_video", while the accurate key in tranformers is "llava-next-video…
-
In both `pip install git+https://github.com/alex-pinkus/tree-sitter-swift.git` and `poetry add git+https://github.com/alex-pinkus/tree-sitter-swift.git` installation fails on a building stage because …
-
Refer to https://swift.readthedocs.io/zh-cn/latest/Multi-Modal/qwen2-vl%E6%9C%80%E4%BD%B3%E5%AE%9E%E8%B7%B5.html
[rank0]: File "/usr/local/lib/python3.10/site-packages/transformers/trainer.py", …
-
So far I've ported the components I needed to support the models I tested, but there are many more in `transformers` and `tokenizers`. For example:
- https://github.com/huggingface/swift-transforme…
-
非常感谢您的工作!我在使用DPO训练全量微调后的InternVL2-8B模型遇到了如下问题:
下面是我的微调脚本:
```
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 \
swift rlhf \
--rlhf_type dpo \
--model_type internvl2-8b \
--model_id_or_path…
bonre updated
11 hours ago
-
参考 https://github.com/modelscope/ms-swift/blob/main/docs/source/Multi-Modal/qwen2-vl%E6%9C%80%E4%BD%B3%E5%AE%9E%E8%B7%B5.md,在 **四个16G V100** 显卡主机上,搭建环境,测试单样本推理脚本时发现,仅单卡时可以正常运行。双卡,三卡和四卡时运行异常。
## 搭建环…
-
The Python transformers library downloads models to `~/.cache/huggingface/hub`. This allows reuse of previously downloaded models by multiple apps. Currently, calling `HubApi.shapshot` in swift-transf…
-
**Describe the bug**
### 使用以下命令部署微调后的模型
CUDA_VISIBLE_DEVICES=1 swift deploy --model_type qwen2-vl-7b-instruct --model_id_or_path /root/ms-swift/train/qwen2-vl-7b-instruct/v1-20240906-145640/checkp…