swift-transformers Search Results

799 results
for swift-transformers

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Ucas-HaoranWei/GOT-OCR2.0 #155

使用ms-swift微调时，transformers库无法正确加载模型。

使用ms-swift版本为2.6.0.dev0，transformers库为4.45.2时报错 [rank1]: Traceback (most recent call last): [rank1]: File "/home/xxx/anaconda3/envs/f_got/lib/python3.10/site-packages/transformers/models/auto/conf…

amannier updated 2 weeks ago
3
Ucas-HaoranWei/GOT-OCR2.0 #149

使用ms-swift微调时，https, left, transformers库的版本冲突

在使用ms-swift微调时，由于swift所使用的版本与模型本身所使用的版本不相同，会导致报错： ImportError: cannot import name 'log' from 'torch.distributed.elastic.agent.server.api'. 在安装ms-swift库时就有报错： ERROR: pip's dependency resolver does n…

HungryFlo updated 3 days ago
3
ml-explore/mlx-swift-examples #150

some models fail to prepare tokens: No chat template

``` Model loaded -> id("mlx-community/phi-2-hf-4bit-mlx") Error: chatTemplate("No chat template was specified") ``` For models that have a chat template this is fine, but for those that do not: …

davidkoski updated 1 week ago
3
huggingface/swift-transformers #4

Tokenizers: additional Normalizers, PreTokenizers, PostProce…

So far I've ported the components I needed to support the models I tested, but there are many more in `transformers` and `tokenizers`. For example: - https://github.com/huggingface/swift-transforme…

pcuenca updated 1 week ago
5
modelscope/ms-swift #2416

训练正常进行但保存检查点时出现 OOM

## 环境信息 - GPU：A100 - 显存：40G - SWIFT版本：v2.5.2 ## 训练脚本 ``` CUDA_VISIBLE_DEVICES=0 PYTORCH_CUDA_ALLOC_CONF="expandable_segments:True" swift sft \ --model_type llama3_2-11b-vision-instruct …

mirrorange updated 1 day ago
1
QwenLM/Qwen2-VL #84

transformers版本问题

pkg_resources.DistributionNotFound: The 'transformers=4.33' distribution was not found and is required by ms-swift 我用swift微调qwen2vl运行报了这个版本错误，但是我是用指南中的pip install git+https://github.com/huggingface/t…

GenuineWWD updated 2 months ago
1
huggingface/sam2-studio #28

Model download selector

We can use one of the snapshot download functions from https://github.com/huggingface/swift-transformers/blob/71963c36da21b29630ee43fa0d748f8f5b88fc33/Sources/Hub/HubApi.swift#L185

pcuenca updated 1 month ago
1
QwenLM/Qwen2-VL #104

微调报错：RuntimeError: CUDA error: too many resources requested …

``` 使用环境： torch==2.4 transformers==4.45.dev0 torchvision==0.19.0 4*V100 NVIDIA-SMI 535.154.05 Driver Version: 535.154.05 CUDA Version: 12.2 微调命令： CUDA_VISIBLE_DEVICES=0,1,…

xiajinxiong updated 5 days ago
1
modelscope/ms-swift #2391

Fine tuning stalling

I am attempting to use the fine tuning with my custom dataset, however the training percentage value keeps staying at 0% and not increasing at all, after 20h of running time: ``` Train: 0%| …

ep0p updated 4 days ago
3
modelscope/ms-swift #2246

Finetuning Qwen2VL yield error when enabling FlashAttention …

**Describe the bug** When using Flash Attention (--use-flash-attention true) to train Qwen2VL model with mixed data (both image and text data), the code will yield the following error ``` [rank0]: …

VietDunghacker updated 1 day ago
7

上一页 1...1 2 3 4 5 6 7...80 下一页

799 results for swift-transformers

799 results
for swift-transformers