distributed-llm Search Results

1000+ results
for distributed-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #28543

IsADirectoryError: [Errno 21] Is a directory: 'my-company/my…

### System Info - `transformers` version: 4.37.0.dev0 - Platform: Linux-5.15.0-89-generic-x86_64-with-glibc2.31 - Python version: 3.10.11 - Huggingface_hub version: 0.19.4 - Safetensors version: …

gventuri updated 1 week ago
9
huggingface/datatrove #208

Is this method implement only in the data parallel ? is the…

Does this method implement the data parallel for the single node and multiple node ?

WenhaoZhang-Git updated 3 months ago
9
pytorch/pytorch #64327

RFC: multi-part `torch.load`/`torch.save` to support huge mo…

## 🚀 Feature ## The need Models are getting bigger and there are times when loading all the params from external storage into CPU memory at once is either not possible or calls for some extra c…

stas00 updated 7 months ago
29
ahyatt/llm #53

Error using open ai

I have just installed EKG and receive the following error when saving a new note: ⛔ Warning (llm): Open AI API is not free software, and your freedom to use it is restricted. See https://openai.co…

peterjauhal updated 1 month ago
19
vllm-project/vllm #5569

[Bug]: BitsandBytes quantization is not working as expected

### Your current environment ```text $ python collect_env.py Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used …

QwertyJack updated 2 weeks ago
31
microsoft/DeepSpeed #5568

Use Pipeline Parallelism and get stuck in the mid[BUG]

**Describe the bug** I try to use pipeline parallelism with transformer, but I just found out it will be stuck in the mid **To Reproduce** Steps to reproduce the behavior: ``` #!/usr/bin/env py…

HackGiter updated 10 hours ago
10
vllm-project/vllm #3196

Unable to run distributed inference on ray with llama-65B, t…

**Issue Description:** When I tried to deploy the llama-hf-65B model on an 8-GPU machine, I followed the example in Distributed Inference and Serving ([link](https://docs.vllm.ai/en/latest/serving/…

hxer7963 updated 3 months ago
6
vllm-project/vllm #4383

[Bug]: ncclSystemError when use two gpus

### Your current environment ```text The output of `python collect_env.py` ``` Collecting environment information... PyTorch version: 2.2.1+cu118 Is debug build: False CUDA used to build PyTorc…

BeanSprouts updated 1 month ago
2
pan-x-c/EE-LLM #11

[BUG] Missing fields in `ee_inference_server.sh`

**Describe the bug** When running ee_inference_server.sh, I constantly received error messages like: ``` Traceback (most recent call last): File "/data/EE-LLM/tools/run_early_exit_text_generatio…

Sunt-ing updated 3 months ago
26
w3c/wcag #1712

Are 'Headings' enough to pass 2.4.1: Bypass Blocks

Sufficient techniques for 2.4.1: Bypass Blocks is: https://www.w3.org/WAI/WCAG22/Understanding/bypass-blocks#techniques H69: Providing heading elements at the beginning of each section of conte…

jake-abma updated 7 months ago
162

上一页 1...89 90 91 92 93 94 95...100 下一页

1000+ results for distributed-llm

1000+ results
for distributed-llm