Mamba-Chat: A chat LLM based on the state-space model architecture 🐍
Apache License 2.0
911
stars
69
forks
source link
I downloaded the mamba-790m file from Hugging Face to my local machine for loading and training. However, I encountered an error during the loading process, like that "Missing key(s) in state_dict: "backbone.layers.0.mixer.A_b_log"" #30
can anybody help me! thanks