issues
search
state-spaces
/
mamba
Mamba SSM architecture
Apache License 2.0
12.7k
stars
1.06k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
How to handle hidden state reset?
#577
Babylonehy
opened
8 hours ago
0
ImportError: /home/ubuntu/.local/lib/python3.10/site-packages/selective_scan_cuda.cpython-310-x86_64-linux-gnu.so: undefined symbol: _ZN2at4_ops10zeros_like4callERKNS_6TensorESt8optionalIN3c1010ScalarTypeEES5_INS6_6LayoutEES5_INS6_6DeviceEES5_IbES5_INS6_12MemoryFormatEE
#576
turian
closed
13 hours ago
4
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xb5 in position 1: invalid start byte
#575
AstroCIEL
opened
2 days ago
0
fixing softplus bug with _chunk_cumsum_bwd_kernel() triton kernel
#574
stephen-youn
closed
2 days ago
0
about reproducible
#573
Liujehong
opened
1 week ago
1
When I train with multi-GPU, the `autotuner.py` function in triton pops up `full_nargs = {**self.nargs, **kwargs, **self.best_config.kwargs} TypeError: 'NoneType' object is not a mapping` error
#572
sugardoll223
opened
1 week ago
0
Significant differences in gradients between `_ref` and `_fn` when using the complex formulation.
#571
karannb
opened
1 week ago
0
Some Questions about Mamba Block Training and modify
#570
xiaosa269
opened
1 week ago
0
mamba-ssm NO GRADIENT during training
#569
xiaxiaoguang
opened
1 week ago
0
Sequential Image Classification
#568
qifeng22
opened
1 week ago
1
Cuda 12.4 - ImportError: libcudart.so.12: cannot open shared object file: No such file or directory
#567
rozariwang
opened
1 week ago
0
LLVM Error when training mamba2
#566
teowenshen
closed
1 week ago
1
Assertion Failed in mamba2 Module During Training: "Unexpected mma -> mma layout conversion"
#565
xiaosa269
opened
2 weeks ago
0
Is the mixing of SSD layers with Attention supported in this codebase?
#564
MaximilienLeClei
closed
2 weeks ago
2
The huggingface-hub reportes an error
#563
wangyf8848
opened
2 weeks ago
0
How to change mamba to mamba2
#562
xiaogege1210
opened
2 weeks ago
2
Error when install from source on windows
#561
DStarEpoch
opened
2 weeks ago
0
[Deprecation] update deprecated pytorch `custom_fwd` and `custom_bwd` function from `torch.cuda.amp`
#560
Jonathanjordan21
closed
2 weeks ago
2
Hi, I wonder the FLOps of prim::PythonOp.MambaInnerFn, could you please provide the code ?
#559
924973292
opened
2 weeks ago
0
I found this line of code causes NaN of `dx`
#558
Liu-zhi-chao
closed
2 weeks ago
0
I'm wondering if this final_state refers to the state depicted by the red circle in the figure
#557
jialiangZ
opened
2 weeks ago
0
Can't train mamba2 from scratch with HF Trainer
#556
npkanaka
opened
3 weeks ago
19
fail
#555
JoyceMind
opened
3 weeks ago
0
update to triton 3.0.0
#554
johnnynunez
closed
2 weeks ago
0
mamba2_simple has no step funcion
#552
qmpzzpmq
opened
3 weeks ago
0
Possible bug in computation of chunk_scan and chunk_state
#551
vidavakil
closed
2 weeks ago
4
FP8 kernels?
#550
johnnynunez
opened
3 weeks ago
0
Mamba2 Error subprocess.CalledProcessError: Command [] returned non-zero exit status 1.
#549
two-tiger
opened
3 weeks ago
0
The problem of Mamba block input consistency but output fluctuation in model validation environment
#548
xypjq
opened
3 weeks ago
4
Fix Consider group_size in layer_norm_bwd
#547
ilyasch2
closed
2 days ago
0
How does the code run on the Macbook
#546
524125153
opened
1 month ago
2
Some questions about different implementations of the SSD algorithm
#545
chairman-lu
opened
1 month ago
2
AMD ROCm Autotrain failed due to ImportError: libc10_cuda.so: cannot open shared object file: No such file or directory
#544
unclemusclez
opened
1 month ago
0
Feat: Add the support for non-learnable RMS norm for large-scale training in `mamba_inner_fn`
#543
younesbelkada
opened
1 month ago
2
Clarifying no build isolation instructions
#542
amoskvic
opened
1 month ago
0
How to get all hidden_states of selective_scan_cuda?
#541
0205090923
opened
1 month ago
2
Mamba.__init__() got an unexpected keyword argument 'layer_idx'
#540
Shadow581
opened
1 month ago
3
OSError: We couldn't connect to 'https://huggingface.co' to load this file, couldn't find it in the cached files and it looks like state-spaces is not the path to a directory containing a file named config.json. Checkout your internet connection or see how to run the library in offline mode at 'https://huggingface.co/docs/transformers/installation#offline-mode'.
#539
xsa12345
opened
1 month ago
1
Datasets error
#538
xsa12345
opened
1 month ago
0
Fix Incorrect Gradients and Illegal Memory Access Error in Mamba2
#537
Hprairie
opened
1 month ago
4
Chunked inference
#536
yhv-wt
opened
1 month ago
1
Issue about the FLOPs of selective scan
#535
Aristo23333
opened
1 month ago
0
ERROR: Failed building wheel for mamba-ssm
#534
yojeep
closed
1 month ago
2
ModuleNotFoundError: No module named 'mamba_ssm.ops.triton.ssd_combined
#533
bkffadia
opened
1 month ago
0
Understanding about the selective scan
#532
Aristo23333
opened
1 month ago
4
Question about d_state.
#531
CacatuaAlan
opened
1 month ago
1
Optimizing the bwd pass of Mamba 2
#530
Hprairie
closed
1 month ago
3
Gradient explosion in Mamba2 training, norm and loss divergence
#529
edwko
opened
1 month ago
3
Results vary greatly across experiments
#528
William-HYWu
opened
1 month ago
0
Vanishing gradient problem with more layer
#527
yjdy
opened
1 month ago
2
Next