-
Hello AnFreTh,
Thank you for your work on this project. I am currently using Mambular to process tabular data, but I am experiencing very slow training speeds. On average, each epoch is taking arou…
-
### 需求描述 Feature Description
任务目标(请描述你正在做的项目是什么,如模型、论文、项目是什么?); 需求场景(请描述你的项目中为什么需要用此功能); 功能描述(请简单描述或设计这个功能)
### 替代实现 Alternatives
希望能出一个paddle的Mamba,最近新模型很多都是基于Mamba的,没有Paddle-Mamba,只能用torch,非常不…
-
[mamba-codestral-7B-v0.1](https://huggingface.co/mistralai/mamba-codestral-7B-v0.1)
-
# Mamba: Selective State Space Modeling | Nathan's Notes
An introduction to Mamba models: faster and better* than transformers
[https://nathanzhao.cc/mamba](https://nathanzhao.cc/mamba)
-
请问用的mamba_ssm包是哪个版本的呀
-
你好,您提供的方案很有帮助,但有个问题想请教。Mamba文件中没有requirement.txt,直接运行pip install . 也出现报错。请问,requirement.txt的内容是什么呢?除了安装triton外,是否还需要安装其他环境?谢谢!
-
what caused this error?
![image](https://github.com/user-attachments/assets/28d42d70-7e78-4813-a270-0ee8d14baeee)
-
**How to customise the train.sh for a distributed Mamba Training ?**
Hello,
As i've seen in the megatron modules, there isn't a pre-defined bash script to pre-train a mamba model on multi-gpu, ho…
-
would be really awesome to get this model running locally
- [blog](https://mistral.ai/news/codestral-mamba/) on codestral mamba
- mamba 1 [paper](https://arxiv.org/pdf/2312.00752)
- mamba 2 [pape…
-
Can I fine-tune the latest Mamba architecture model using LLaMA-Factory?