swift-transformers Search Results

798 results
for swift-transformers

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

modelscope/ms-swift #1121

Quantization stops for no reason when GPTQ quantizing Qwen2-…

**Describe the bug** What the bug is, and how to reproduce, better with screenshots(描述bug以及复现过程，最好有截图) ![图片](https://github.com/modelscope/swift/assets/77217949/fbaf7a1f-9270-4c32-bcae-56938fc7c5f4)…

edgeinfinity-wzt updated 2 months ago
4
modelscope/ms-swift #1267

Finetuning Florence on the forgetting problem

Hi, finetuning on flrorence-ft model, the model gets forgetting in old knowledge, (the way we using is not use florence directly, we training it and then adopt the vision encoder to large LLM instead)…

lucasjinreal updated 2 months ago
12
winpython/winpython #1372

Release 2024-05 follow-up (postpone as sudden antivirus comp…

release date: - postpone till further notice, as anti-virus are not happy on things starting 2024-04 b6 of September 1st - can't continue till eventual clear-up, and removed all downloads associate…

stonebig updated 3 weeks ago
30
huggingface/diffusers #8435

Runtime error with from_single_file

### Describe the bug Versioning diffusers to 0.29.0.dev0 did not solve the problem. ### Reproduction pip install diffusers from diffusers import StableDiffusionPipeline pipe = StableDiffusionPipe…

suzukimain updated 4 months ago
9
modelscope/ms-swift #790

No module named 'transformers.models.gemma', transformers==4…

RuntimeError: Failed to import swift.tuners.base because of the following error (look up to see its traceback): No module named 'transformers.models.gemma'

Jintao-Huang updated 6 months ago
1
modelscope/ms-swift #340

调用p-tuning报错

`import os os.environ['CUDA_VISIBLE_DEVICES'] = '3' from modelscope import Model, AutoModelForSequenceClassification, AutoTokenizer, MsDataset from swift import Swift, LoRAConfig, AdapterConfig, Tr…

linguoqi updated 4 months ago
2
modelscope/ms-swift #792

将vllm版本从0.3.1升级到0.4.0后，使用swift部署模型，服务请求时间明显变长

**Describe the bug** What the bug is, and how to reproduce, better with screenshots(描述bug以及复现过程，最好有截图) 将vllm版本从0.3.1升级到0.4.0后，使用swift部署模型，在相同模型、相同prompt的情况下，服务请求时间明显变长（2倍以上），server部署命令参数没有做任何修改 C…

HIT-Owen updated 5 months ago
6
modelscope/ms-swift #843

'qwen1half-110b-chat-awq' is not registered.

模型:Qwen1.5-110B-Chat-AWQ 执行命令 CUDA_VISIBLE_DEVICES=1,6 swift infer --model_type qwen1half-110b-chat-awq --infer_backend vllm --max_model_len 8192 --model_id_or_path /share/models/Qwen1.5-110B-Cha…

menkeyi001 updated 6 months ago
1
modelscope/ms-swift #1006

infer 无法跑完所有data

CUDA_VISIBLE_DEVICES=0 \ NPROC_PER_NODE=1 \ nproc_per_node=1 \ swift infer \ --ckpt_dir "output_llava/llava1d6-mistral-7b-instruct/v32-20240524-165418/checkpoint-2003" \ --custom_val_dataset_path…

AlexJJJChen updated 4 months ago
14
modelscope/ms-swift #698

fine-tune qwen-vl-chat issues: using scripts/qwen_vl_chat/q…

RuntimeError: Input type (torch.cuda.ByteTensor) and weight type (CUDABFloat16Type) should be the same: Details is : Traceback (most recent call last): File "/datadisk/ai-prj/swift/examples…

hyhzl updated 6 months ago
2

上一页 1...36 37 38 39 40 41 42...80 下一页

798 results for swift-transformers

798 results
for swift-transformers