Some minor mamba-2 fixes (previously there could be some issues with peft, now it should be fixed). And I also updated mamba-2 default state_size to 64 as set in https://github.com/state-spaces/mamba . Mamba-2 is a bit more optimized in terms of states so it's default state_size should be 64 or 128 instead of 16 like for mamba-1.
Some minor mamba-2 fixes (previously there could be some issues with peft, now it should be fixed). And I also updated mamba-2 default state_size to 64 as set in https://github.com/state-spaces/mamba . Mamba-2 is a bit more optimized in terms of states so it's default state_size should be 64 or 128 instead of 16 like for mamba-1.
Training benchmark output: