MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained. 怎么解决 #296

Zhaojjjjjj commented 1 week ago

是否已有关于该错误的issue或讨论? | Is there an existing issue / discussion for this?

该问题是否在FAQ中有解答? | Is there an existing answer for this in FAQ?

当前行为 | Current Behavior

% python3
Loading checkpoint shards: 100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 7/7 [00:45<00:00,  6.57s/it]
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.

期望行为 | Expected Behavior

复现方法 | Steps To Reproduce

import torch from PIL import Image from transformers import AutoModel, AutoTokenizer

model = AutoModel.from_pretrained('openbmb/MiniCPM-Llama3-V-2_5', trust_remote_code=True, torch_dtype=torch.float16) model =

tokenizer = AutoTokenizer.from_pretrained('openbmb/MiniCPM-Llama3-V-2_5', trust_remote_code=True) model.eval()

image ='xx.jpg').convert('RGB') question = 'What is in the image?' msgs = [{'role': 'user', 'content': question}]

res = image=image, msgs=msgs, tokenizer=tokenizer, sampling=True, temperature=0.7, ) print(res)

res = image=image, msgs=msgs, tokenizer=tokenizer, sampling=True, temperature=0.7, stream=True )

generated_text = "" for new_text in res: generated_text += new_text print(new_text, flush=True, end='')

运行环境 | Environment

- OS:M2 pro
- Python:3.8
- Transformers: 4.31.0
- PyTorch:2.0.1
- CUDA (`python -c 'import torch; print(torch.version.cuda)'`):

备注 | Anything else?

