Open xiechengmude opened 7 months ago
It looks like some huggingface-specific arguments (bos_token_id) might be passed to the model automatically, but there's very little context here to understand what happens exactly. Could you maybe share some more code, specifically how you initialize the model?
I train the model via axolot .
Heres the chat.py error:
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained. Traceback (most recent call last): File "/workspace/mamba-chat/xdan-chat.py", line 12, in
model = MambaLMHeadModel.from_pretrained(model_path, device="cuda", dtype=torch.float16)
File "/root/miniconda3/envs/axo/lib/python3.10/site-packages/mamba_ssm/models/mixer_seq_simple.py", line 231, in from_pretrained
model = cls(config, device=device, dtype=dtype, kwargs)
File "/root/miniconda3/envs/axo/lib/python3.10/site-packages/mamba_ssm/models/mixer_seq_simple.py", line 190, in init
self.backbone = MixerModel(
TypeError: MixerModel.init() got an unexpected keyword argument 'bos_token_id'