-
### Feature request
Pytorch XLA/PJRT TPU support for bitsandbytes
### Motivation
Would allow for faster and more memory efficient training of models on TPUs.
### Your contribution
Happy to prov…
-
### Description
This kernel can run successfully:
```python
import functools
import jax
from jax.experimental import pallas as pl
import jax.numpy as jnp
@functools.partial(
pl.pallas_…
-
#1323 added single-device TPU Pods support. Multi-device TPU Pods have not been supported because running multi-node tasks on them may require changes to dstack.
Currently, dstack runs different jo…
-
Hi,
I've got a question here trying to setup alignment on spark tokens. The problem I am facing that all the tokens do align with each other, which is in some cases fine.
But I'd like that some…
-
(translate from korean with Claude)
# System Info
- Platform: Linux-6.5.0-45-generic-x86_64-with-glibc2.35
- Python version: 3.11.10
- PyTorch version: 2.4.1+cu124
- CUDA device(s): NVIDIA L40S…
-
### Description
Hi. I am extending the Pallas paged attention kernel. The case is a MQA. When I run my kernel, I encountered the following error which suggests it is an internal error and I should …
-
Hi-
I was wondering if it is possible to make these 2+1 tpus work in a pcie x8 with a splitter - wnated to confirm before buying:
stack would be:
- dual edge on this adapter: https://www.make…
-
### System Info
- `transformers` version: 4.45.0.dev0
- Platform: Linux-5.4.0-1043-gcp-x86_64-with-glibc2.31
- Python version: 3.10.14
- Huggingface_hub version: 0.24.6
- Safetensors version: 0.4…
-
### Steps to reproduce
```
type: service
# The name is optional, if not specified, generated randomly
name: llama31
# Using a Docker image with a fix instead of the official one
# More details…
-
Hi,
I am trying to combine both GSPMD + PyTorch Compile, but it doesn't work.
I took a copy of the test script "test_train_spmd_imagenet.py" and test it in colab, and it started normally. However,…