mamba-state-space-models Search Results

165 results
for mamba-state-space-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

unslothai/unsloth #410

Request to support RWKV and Mamba SSMs

Hello, I have been using unsloth for my fine-tuning purposes and am really enjoying the framework so far! I just wanted to know if you could add support for loading and training state space models l…

asphytheghoul updated 5 months ago
1
state-spaces/mamba #503

CUDA error when using Mamba2 with long context

Hi, I am benchmarking inference speed on long sequences and encountering CUDA-related errors specifically with the Mamba2 models at longer sequence lengths (>200k). This issue does not occur with Mamb…

titzehong updated 2 weeks ago
5
basf/mamba-tabular #129

Improve Mamba Speed

Hello AnFreTh, Thank you for your work on this project. I am currently using Mambular to process tabular data, but I am experiencing very slow training speeds. On average, each epoch is taking arou…

Jasmine-ycj updated 2 days ago
7
state-spaces/mamba #575

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xb5 in …

i have set up the environmet succesfully, but when i run `lm_eval --model mamba_ssm --model_args pretrained=state-spaces/mamba-130m --tasks lambada_openai,hellaswag,piqa,arc_easy,arc_challenge,winogra…

AstroCIEL updated 3 days ago
2
ggerganov/llama.cpp #8519

Feature Request: Support Codestral Mamba

### Feature Description New 7B coding model just released by Mistral. - **Blog Post**: https://mistral.ai/news/codestral-mamba/ - **HF**: https://huggingface.co/mistralai/mamba-codestral-7B…

VelocityRa updated 6 days ago
11
mistralai/mistral-inference #192

[BUG: AssertionError: Mamba is not installed. Please install…

### Python -VV ```shell (codestral) ➜ dev python -VV Python 3.10.14 (main, May 6 2024, 19:42:50) [GCC 11.2.0] ``` ### Pip Freeze ```shell (codestral) ➜ dev pip freeze absl-py==2.1.0 addict==…

matbee-eth updated 1 month ago
8
vllm-project/vllm #9006

[Roadmap] vLLM Roadmap Q4 2024

This page is accessible via [roadmap.vllm.ai](https://roadmap.vllm.ai) ### Themes. As before, we categorized our roadmap into 6 broad themes: broad model support, wide hardware coverage, state of…

simon-mo updated 18 hours ago
2
state-spaces/mamba #556

Can't train mamba2 from scratch with HF Trainer

I'm trying to train mamba2 130m from scratch. ``` config = Mamba2Config( vocab_size=len(tokenizer.vocab), n_positions=10, n_embd=768, …

npkanaka updated 2 days ago
21
state-spaces/mamba #196

What Are the Training Time and Space Complexities for Mamba …

Hi! I've been exploring the Mamba architecture with great interest, especially its computational efficiencies compared to traditional Transformer models. The selective state space approach and the …

mrconter1 updated 7 months ago
1
state-spaces/mamba #22

Query Regarding Mamba Model Performance Tuning

Dear Mamba Contributors, I hope this message finds you well. I am in the process of utilising the Mamba state space architecture for a language modelling task and have been highly impressed with th…

yihong1120 updated 2 weeks ago
2

上一页 1...1 2 3 4 5 6 7...17 下一页

165 results for mamba-state-space-models

165 results
for mamba-state-space-models