-
### Suggestion Description
Dear ROCm developers,
according to some tests I performed, Managed Memory was not really working in ROCm 5.x but it does work at least in ROCm 6.1.2. Is the XLA implemen…
-
**Describe the bug**
I am experiencing a CUDA out of memory error while training a Mamba 2.8b model with DeepSpeed using ZeRO 3. The issue occurs during the backward pass, and I have tried adjusting …
-
_ToDo: determine phd focus and scope_
Phd Funding project: https://www.tudelft.nl/en/2020/tu-delft/eur33m-research-funding-to-establish-trust-in-the-internet-economy
Duration: 1 Sep 2023 - 1 sep 2…
-
**LocalAI version:**
LocalAI version: v2.20.1 AIO-GPU-cuda12
**Environment, CPU architecture, OS, and Version:**
Linux dell4090 6.9.3-76060903-generic #202405300957~1721174657~22.04~…
-
Nice implementation!
I thought that Mamba was somewhat recurrent, like keeping an internal state and then outputting one token at a time. But your code shows that for each new output token, the ent…
-
When running
```
python benchmarks/benchmark_generation_mamba_simple.py --model-name "state -spaces/mamba-2.8b" --prompt "My cat wrote all this CUDA code for a new language model and" --topp 0.9 --…
-
### Package Name
altair, ibis-framework[duckdb], leafmap[libremap], myst
### Hub URL
nature.datahub.berkeley.edu
### Course Name
ESPM 157
### Semester Details
Fall 2024
### Ins…
-
**Describe the bug**
I am testing [demos for WH](https://github.com/tenstorrent/tt-metal/tree/main#wormhole-wh-models) on N150. And encountering errors.
**To Reproduce**
Steps to reproduce the b…
-
The following paper:
```
@misc{adams2024point2ssm,
title={Point2SSM++: Self-Supervised Learning of Anatomical Shape Models from Point Clouds},
author={Jadie Adams and Shireen Elhabian…
-
Hi, so I'm trying to make an equivalent example to the one presented on the README but for autoregressive generation.
Basically I want to make sure that inference step by step is the same as when …