Closed jhuang265 closed 2 weeks ago
What is your mamba_ssm
version? are you able to load+evaluate this model outside of lm-eval-harness?
And does evaluating Mamba-2 models work for you? If they don't, for now we should pin the mamba_ssm version.
I'm using mamba_ssm
version 2.2.2
(I installed it in editable form but the latest release before I installed it was this one).
Evaluating all other mamba models works fine with the lm-evaluation-harness
. I've been able to generate text with state-spaces/transformerpp-2.7b
without the harness just fine (through explicitly calling model.generate(...)
), so I assume there's some minor compatibility issue with this specific model that is related to how the harness calls the generation method.
Going to close this under the assumption that it appears to be something that needs handling on the mamba_ssm
side of things, sorry!
There appears to be an issue with the
state-spaces/transformerpp-2.7b
model (in themamba_ssm
family of models) which causes a problem when generating (Running generate_until requests
). This doesn't happen forRunning loglikelihood requests
, so I think there might be a specific issue that relates to the underlying calls. This doesn't happen for any other models with themamba
architectureThe full stack-track is
Note that I installed both lm-eval and mamba as an editable module in a virtual environment.