-
### System Info
- `transformers` version: 4.43.4
- Platform: Linux-4.9.151-015.ali3000.alios7.x86_64-x86_64-with-glibc2.17
- Python version: 3.8.18
- Huggingface_hub version: 0.24.6
- Safetensors…
-
### Please check that this issue hasn't been reported before.
- [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports…
-
**Describe the bug**
FSDP CPU offloading (`model.fsdp=True` and `model.fsdp_cpu_offload=True`) raises errors due to disallowed device placements (see error and full traceback below). This behavior …
-
### Please check that this issue hasn't been reported before.
- [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports…
-
### System Info
```Shell
- `Accelerate` version: 0.18.0
- Platform: Linux-5.4.0-124-generic-x86_64-with-glibc2.31
- Python version: 3.9.12
- PyTorch version (GPU?): 1.12.0 (True)
- `Accelerate` d…
-
### Please check that this issue hasn't been reported before.
- [X] I searched previous [Bug Reports](https://github.com/axolotl-ai-cloud/axolotl/labels/bug) didn't find any similar reports.
### Exp…
-
Been using Prodigy for a few days and honestly I'm very impressed by its performance. Especially, I can set a large learning rate (lr=1, d_coef=10) without blowing up the gradients. However, the final…
-
Hi thanks for this contribution
as a small exercise I am training SD2 on the pokemon dataset
I precomputed the latents and it starts training on one gpu
However at the evaluation time I get the fol…
-
### System Info
```Shell
Copy-and-paste the text below in your GitHub issue
- `Accelerate` version: 0.12.0
- Platform: Linux-5.4.0-105-generic-x86_64-with-debian-buster-sid
- Python version: …
-
https://github.com/shibing624/textgen/blob/0339b3ed20004a677eb1250ac53ff132fe64e9a0/examples/chatglm/training_chatglm_adgen_demo.py#L47
你好,麻烦请问一下,这里这样读取数据后,在 chatglm_model.py第243-245显示读取的数据为空,这里应该怎么理…