Open abc5z7 opened 2 months ago
` self.layers = nn.ModuleList( [ create_block ... ] )
mixer_cls = partial(Mamba, layer_idx=layer_idx, bimamba_type=bimamba_type, if_devide_out=if_devide_out, init_layer_scale=init_layer_scale, ssm_cfg, factory_kwargs)
from mamba_ssm.modules.mamba_simple import Mamba
` import Mamba is a module that contains the SiLU implemented, you can find it over here : https://github.com/hustvl/Vim/blob/main/mamba-1p1p1/mamba_ssm/modules/mamba_simple.py
Thank you for the excellent work.
I have the same question. A single activation function controls both the forward and backward paths. How does directly importing Mamba achieve control of both paths? Could you point out the specific location in the code?
Thank you very much.
i dont see any
SiLU
function which should be here.did i get the wrong file? anyone could explain it? thanks so much! invim/models_mamba.py