Closed AshD closed 3 months ago
Perhaps I am gravely mistaken, but is there any chance the mistral json file that defines the mistral architecture has been modified on your end? self_attn.k_norm.weight
in addition to being an odd weight name doesn't exist here https://github.com/arcee-ai/mergekit/blob/main/mergekit/_data/architectures/mistral.json
Thanks. That was it.
Mergekit (8/18/24) : Trying to create a passthrough merge and it fails with this error RuntimeError: Tensor model.layers.86.self_attn.k_norm.weight required but not present in model mistralai/Mistral-Large-Instruct-2407
mergekit-config is
Output