-
Hi,
I want to continue training of the Bloom model. To start simple, I want to load the 1.1B model into the BigScience Megatron-DeepSpeed library.
I tried to run pretrain_gpt.py with the argum…
-
Hello, I installed your package using `setup/setup.sh`. The single-GPU command in the tutorial works fine, but when I run the multi-GPU command `deepspeed --num_gpus 8 --num_nodes 2 --master_addr mach…
-
Getting this error while pretraining LLama2 on A100 gpu. Using NCCL version 2.19.3. Running it on single vm with single A100 GPU.
Spotllm:73025:73025 [0] NCCL INFO Bootstrap : Using eth0:10.0.0.4
…
-
I got unsloth running in native windows, (no wsl). You need visual studio 2022 c++ compiler, triton, and deepspeed. I have a full tutorial on installing it, I would write it all here but I’m on mobile…
-
## 🚀 Feature
Add the ability to translate the following Collective Communication ops to native
XLA instructions:
* `all_gather`
* `reduce_scatter`
* `collective_permute`
* `send`
* `recv`…
-
### Feature request
Similar to how model parameter and optimizer offload is supported using the [deepspeed library](https://github.com/huggingface/transformers/blob/de13a951b38b85195984164819f1ab05…
-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
ERROR: Cannot install llamafactory and llamafactory[metrics,torch-npu]==0.9.1.dev0 because these package …
-
susie.sun@yz-amd1:~$ docker run -it rocm/deepspeed:rocm5.7_ubuntu20.04_py3.9_pytorch_2.0.1_DeepSpeed /bin/bash
root@c50e90963e1a:/var/lib/jenkins# deepspeed --num_gpus 1 deploy.py
[2023-12-14 01:52:…
-
All other libraries for language models load the model in default model quantization unless explicitly specified. https://github.com/unslothai/unsloth/blob/27fa021a7bb959a53667dd4e7cdb9598c207aa0d/uns…
-
I get this error following the deepspeed-fastgen instructions:
```python
from mii import pipeline
pipe = pipeline("mistralai/Mistral-7B-v0.1")
```
The full stack trace is:
```
Loading ext…