Open turian opened 11 months ago
We don't have that code in a releasable form. It's essentially the S4 model put inside the Mamba block, so it's pretty easy to implement.
@albertfgu one question to bother you.
Is there the minor module of S4 in mamba or s4 repos? Thanks.
We aren't using submodules. You'll need to incorporate the code from the S4 repo separately.
@albertfgu Thank you for your work on mamba. I was wondering if you could elaborate on the architecture of the S4+mamba block a bit more? I am trying to reproduce mamba+s4 with a unet as described in the paper, but I so far only have working forward pass for a sashimi-like model with just mamba blocks. Any additional detail you can provide would be much appreciated!
Take the Mamba block and replace the S6 layer with S4.
@albertfgu To my understanding, Mamba-S4 is this block with the S4Block as mixer
. Is that correct?
Could you share code for Mamba-S4, the audio model for longform signals?