Closed ZG-yuan closed 3 months ago
You need to change the template in conversation.py
replace the sep=""
as sep="<s>".
Otherwise, the mode will regard blank space as stop token.
Then everything goes well.
Strange bug solved 🕶️
@Rickylht Thank you very much!
Some weights of the model checkpoint at /mnt/disk/zgy/LLaVA-Med/LLaVA-Med-main/llava-med-v1.5-mistral-7b were not used when initializing LlavaMistralForCausalLM: ['model.vision_tower.vision_tower.vision_model.encoder.layers.19.mlp.fc1.weight',..........] USER: is heart CT? The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input's
attention_mask
to obtain reliable results. Settingpad_token_id
toeos_token_id
:2 for open-end generation. ASSISTANT: Yes USER: is leg CT? The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input'sattention_mask
to obtain reliable results. Settingpad_token_id
toeos_token_id
:2 for open-end generation. ASSISTANT: No USER:What is it? The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input'sattention_mask
to obtain reliable results. Settingpad_token_id
toeos_token_id
:2 for open-end generation. ASSISTANT: This