-
**Describe the bug**
I have two ubuntu machines, and with 10Gb/s erthnet cable connected and I want to use deepspeed to use these two machines to
run a model training with pipeline parallel, and …
-
### System Info
Package Version
------------------------ ---------------
accelerate 0.30.1
aiohttp 3.9.5
aiosignal 1.3.1
annotated-…
-
### System Info
- `transformers` version: 4.38.0.dev0
- Platform: Linux-5.4.0-169-generic-x86_64-with-glibc2.31
- Python version: 3.10.13
- Huggingface_hub version: 0.20.3
- Safetensors version…
-
### Checklist
- [X] The issue exists after disabling all extensions
- [X] The issue exists on a clean installation of webui
- [ ] The issue is caused by an extension, but I believe it is caused by a …
-
hi!
i am trying to use the dpo trainer to fine-tune a mixtral 8*7B model in 16bit precision (i've already completed fine-tuning for a 4bit model without issues, but unfortunately the quantized adap…
-
- Develop machine learning models for categorizing user responses into predefined categories or topics relevant to the research objectives.
- Implement sentiment analysis algorithms to automatically …
-
Subscribe to this issue and stay notified about new [daily trending repos in C#](https://github.com/trending/c%23?since=daily).
-
### Is there an existing issue for this?
- [X] I have searched the existing issues and checked the recent builds/commits
### What happened?
I installed the stable diffusion web ui and it worked fin…
-
I am facing an issue with the below configuration (was working yesterday and for the last week) where the model loads and dataset is tokenized but then the script hangs (GPU utilization spikes to 100%…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues and checked the recent builds/commits
### What happened?
Autolaunch seems to be enabled by default now, but there …