-
Platforms: linux
This test was disabled because it is failing in CI. See [recent examples](https://hud.pytorch.org/flakytest?name=test_fsdp_unsupported_module_cls&suite=TestFSDPMiscMultiThread&limit=…
-
### 🐛 Describe the bug
to_sparse() is returning a FakeTensor where the indices attribute has the wrong shape/size.
### Error logs
_No response_
### Minified repro
Repro 1:
```python
import …
-
### 🚀 The feature, motivation and pitch
## Motivation: Limitation of Existing Profiling Approach
To conduct PyTorch distributed training performance analysis, currently a recommended way is profil…
wayi1 updated
2 weeks ago
-
### Please check that this issue hasn't been reported before.
- [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports…
-
I use this setting below to train flux lora:
```
accelerate launch --gpu_ids 0,1 --main_process_port 29502 --mixed_precision bf16 --num_cpu_threads_per_process=2 \
flux_train_network.py --pr…
-
## 🚀 Feature
Thunder recipes and new high-level entrypoint.
This is important
- for new users, or users that just want to take advantage of thunder without getting into the how it works
- for …
-
We have discussed the following so far.
- Decide which domain
- Math([GSM8k](https://huggingface.co/datasets/gsm8k)), Code([Stack 2](https://huggingface.co/datasets/bigcode/the-stack-v2)), Gene…
-
Hi chaoyi,
Thanks for your great work. I have a question about dataset tokenization in the following code.
https://github.com/chaoyi-wu/Finetune_LLAMA/blob/1d4280e12f584b20cbb92a9f0dfe3a12a5de9…
-
Hello!
I was wondering if there was anything extra that needed to be done to get training with Hidet compiler working.
Out of the box I seem to be running into errors
```
import torch
from p…
-
Dear ray_lightning users and community members,
We wanted to share some important news regarding the future of the ray_lightning library. As you know, ray_lightning has been a valuable library for …