-
Hello!
I've encountered an issue trying to run dreambooth training with deepspeed in kohya_ss.
I am running into the error, which seems to occure inside the deepspeed stage_1_and_2.py line 50…
-
### Your current environment
The output of `python collect_env.py`
```text
Collecting environment information...
CPU:
Architecture: x86_64
CPU op-mode(s): …
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
Hi, thank you for your nice work. I left the question to ask the availability of batch g…
-
**Describe the bug**
Deepspeed loads the whole model to every GPUs.
When running Llama2-13b in full precision:
**To Reproduce**
I followed the tutorial in https://www.deepspeed.ai/tutorials/…
yunoJ updated
1 month ago
-
### Description
With t2t 1.6.6, tensorflow 1.8.0, I ran cifar100 with eval early stopping. The cmd failed quickly with crash at tensorboard/backend/event_processing/event_multiplexer.py, GetAccumulat…
-
### Description
Hi guys, I'm very excited with recent activations offloading mechanism introduced in JAX/XLA:GPU but I'm unable to make it work with the scan.
My setup is the following - I'm train…
-
# 💻 cs
## 📚 mask (total: 9)
### 📃 Deep Pneumonia: Attention-Based Contrastive Learning for Class-Imbalanced Pneumonia Lesion Recognition in Chest X-rays
- **Authors:** Xinxu Wei, Haohan Bai, Xianshi …
-
I downloaded the new one and it would output garbage over and over. Figured it was the quant so I loaded the previous versions. All output repeating nonsense. Tried both textgen and tabbyAPI. Other mo…
-
Treating this as high priority, since it simplifies debugging multidevice code on GPUs.
We can use the virtual device abstraction even when kernels consume all Streaming Multiprocessors, to let the us…
-
### System Info
Hi all, I have been running benchmark and testing myself to get a sense how an ideal setup for deploying some models is, and in the past a few months I've noticed an issue with Tens…
ghost updated
6 months ago