I guess some flash_attn has some problems. When I load models from huggingface: evo_model = Evo('evo-1-8k-base'), an error: self.inner_mha_cls = MHA( NameError: name 'MHA' is not defined or assert RotaryEmbedding is not None, "rotary_emb is not installed" AssertionError: rotary_emb is not installed occur.
My environment: flash_attn 2.5.6 Cuda compilation tools, release 11.6, V11.6.124 pytorch 1.13.1.
Anyone has solutions?
I guess some flash_attn has some problems. When I load models from huggingface:
evo_model = Evo('evo-1-8k-base')
, an error:self.inner_mha_cls = MHA( NameError: name 'MHA' is not defined
orassert RotaryEmbedding is not None, "rotary_emb is not installed" AssertionError: rotary_emb is not installed
occur. My environment: flash_attn 2.5.6 Cuda compilation tools, release 11.6, V11.6.124 pytorch 1.13.1. Anyone has solutions?