deepspeed-library Search Results

1000+ results
for deepspeed-library

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

bigscience-workshop/Megatron-DeepSpeed #350

Load Bloom Optimizer State (i.e. Bloom 1B1)

Hi, I want to continue training of the Bloom model. To start simple, I want to load the 1.1B model into the BigScience Megatron-DeepSpeed library. I tried to run pretrain_gpt.py with the argum…

philippmtk updated 1 year ago
2
stanford-crfm/mistral #196

torch_extensions/py38_cu113/fused_adam/fused_adam.so: cannot…

Hello, I installed your package using `setup/setup.sh`. The single-GPU command in the tutorial works fine, but when I run the multi-GPU command `deepspeed --num_gpus 8 --num_nodes 2 --master_addr mach…

yandachen updated 1 year ago
6
NVIDIA/nccl #1240

ncclSystemError: System call (e.g. socket, malloc) or extern…

Getting this error while pretraining LLama2 on A100 gpu. Using NCCL version 2.19.3. Running it on single vm with single A100 GPU. Spotllm:73025:73025 [0] NCCL INFO Bootstrap : Using eth0:10.0.0.4 …

amitagh updated 7 months ago
4
unslothai/unsloth #210

I got unsloth running in native windows.

I got unsloth running in native windows, (no wsl). You need visual studio 2022 c++ compiler, triton, and deepspeed. I have a full tutorial on installing it, I would write it all here but I’m on mobile…

NicolasMejiaPetit updated 5 days ago
19
pytorch/xla #3138

[RFC] Exposing additional XLA collective communication primi…

## 🚀 Feature Add the ability to translate the following Collective Communication ops to native XLA instructions: * `all_gather` * `reduce_scatter` * `collective_permute` * `send` * `recv`…

hjm-aws updated 4 months ago
16
huggingface/transformers #28738

Any plans to support KV Cache offloading to CPU (and NVMe)?

### Feature request Similar to how model parameter and optimizer offload is supported using the [deepspeed library](https://github.com/huggingface/transformers/blob/de13a951b38b85195984164819f1ab05…

goelayu updated 7 months ago
5
hiyouga/LLaMA-Factory #5763

华为NPU适配，依赖冲突。

### Reminder - [X] I have read the README and searched the existing issues. ### System Info ERROR: Cannot install llamafactory and llamafactory[metrics,torch-npu]==0.9.1.dev0 because these package …

yangyang6666 updated 1 week ago
8
ROCm/DeepSpeed #68

[BUG] I have pulled the docker images,but when I run it ,I g…

susie.sun@yz-amd1:~$ docker run -it rocm/deepspeed:rocm5.7_ubuntu20.04_py3.9_pytorch_2.0.1_DeepSpeed /bin/bash root@c50e90963e1a:/var/lib/jenkins# deepspeed --num_gpus 1 deploy.py [2023-12-14 01:52:…

sunpian1 updated 6 months ago
17
unslothai/unsloth #588

load_in_4bit should be False by default.

All other libraries for language models load the model in default model quantization unless explicitly specified. https://github.com/unslothai/unsloth/blob/27fa021a7bb959a53667dd4e7cdb9598c207aa0d/uns…

ronakk-google updated 5 months ago
3
microsoft/DeepSpeed-MII #273

Unable to load ragged_device_ops op due to no compute capabi…

I get this error following the deepspeed-fastgen instructions: ```python from mii import pipeline pipe = pipeline("mistralai/Mistral-7B-v0.1") ``` The full stack trace is: ``` Loading ext…

rogerbock updated 5 months ago
10

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for deepspeed-library

1000+ results
for deepspeed-library