distributed-llm Search Results

1000+ results
for distributed-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #2580

RuntimeError on ROCm

Example of command: ```python benchmark_throughput.py --model gpt2 --input-len 256 --output-len 256``` Output: ```Namespace(backend='vllm', dataset=None, input_len=256, output_len=256, model='gpt…

rlrs updated 3 days ago
7
huggingface/datatrove #62

Support Ray as executor

Ray (https://github.com/ray-project/ray) becomes popular choice of running distributed Python ML applications. Its Python interface is easy to scale up the workload from local laptop to distributed cl…

c21 updated 4 months ago
2
intel-analytics/ipex-llm #9628

Failed to run BigDL-LLM on Multiple ARC770 using DeepSpeed A…

Case: BigDL/python/llm/example/GPU/Deepspeed-AutoTP Model: Llama-2-7b-hf ARC770: 2 cards env: RPL RVP, ubuntu22.04, kernel-6.4.1, mem-32G oneAPI 23.2.0 Running result: ``` (llm_multi) intel@ub…

liang1wang updated 7 months ago
2
mosaicml/llm-foundry #1231

LLaMA PRO training resume problem

Hello, I'm currently training LLaMA PRO. Initially, I expanded the model from 32 layers to 40 layers and proceeded to train only the newly added 8 layers (every fifth layer). Therefore, I froze 32 …

germanjke updated 1 week ago
6
vllm-project/vllm #5537

[Bug]: CUDA illegal memory access error when `enable_prefix_…

### Your current environment ```text The output of `python collect_env.py` PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A …

mpoemsl updated 6 days ago
9
NVIDIA/TensorRT-LLM #1173

BERT Model is Inaccurate

### System Info - CPU: i9 9900k - GPU: RTX 4090 - TensorRT-LLM Version: 0.9.0.dev2024022000 - Cuda Version: Cuda 12.3 - Driver Version: 545.29.06 - OS: Arch Linux, kernel version 6.7.5 ### …

Broyojo updated 1 month ago
6
dayuyang1999/random_code #3

final solution

```python def split_dict_equally(input_dict, chunks=8): # A list of dictionaries to hold the split dictionary split_dicts = [{} for _ in range(chunks)] # Get all the keys from the inpu…

dayuyang1999 updated 1 week ago
5
vllm-project/vllm #6333

[Bug]: Error on inference with LoRa request (safetensors fo…

### Your current environment ```text Collecting environment information... PyTorch version: N/A Is debug build: N/A CUDA used to build PyTorch: N/A ROCM used to build PyTorch: N/A OS: Debia…

tsvisab updated 8 hours ago
3
FlagOpen/FlagEmbedding #846

raise RuntimeError("mmap can only be used with files saved w…

``` Unsloth: Offloading input_embeddings to disk to save VRAM Unsloth: Offloading input_embeddings to disk to save VRAM Traceback (most recent call last): File "/data/llmodel/Tools/software_inst…

v-yunbin updated 1 month ago
2
vllm-project/vllm #962

can model Qwen/Qwen-VL-Chat work well?

when i use Qwen/Qwen-VL-Chat I do not know why! throw a error `Traceback (most recent call last): File "test.py", line 20, in model = LLM(model=model_path, tokenizer=model_path,tokeni…

wangschang updated 3 weeks ago
7

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for distributed-llm

1000+ results
for distributed-llm